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Theory of an Accurate Intermediary Orbit for 
Satellite Astronomy 
John P. Vinti 
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lhis paper derives an accurate intermediary orbit of an artificial satellite of an oblate 
i drag-free motion takes place inder the wtion of a gravitational potential 
vhich fits the even zonal harmonics exactly through the second and approximately throug! 
the | eartl This potenti ( 1s to separability of the H 
Jas 1u 
[wo alternative sets of orbital elements are set fort! he first set is related direct 
litions, but requires 1 merical factoring of certain quartic *o © 
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1. Introduction 


The author has introduced a gravitational potential | for an axially SVvinmetre plat et 
which accounts rather accurately for its oblateness and vet still leads to separability of the 
problem of satellite motion The resulting solution is expected to vield al “intermediary 
orbit” somewhat more accurate than those heretofore used, in that it accounts tor all of the 
second zonal harmonic and for more than half of the fourth zonal harmon Previous in- 
termediary orbits 2 3! have accounted for only part ol the second harmon and have neglec ted 
the fourth harmonic The residual fourth harmonic, the odd harmonics, the tesseral harmonics, 
the lunar-solar forces, and aerodynamic and ele tromagnetic drag are then to be considered as 
producing perturbations ot this Intermed ary orbit 

Izsak [4] has already given an analytical solution for this ntermediary orbit, with both 
period and secular terms correct tl rough the second order in a certain oblate ness parameter 
His solution makes rathet heavy demands on the reader's knowledge of linear fractional trans- 
formations and the theory of elliptic functions in the complex plane The present paper 


avoids these complications, with elliptic integrals occurring only in the simple forms of the 


complete first and second kinds. Furthermore the resulting solution not only gives the periodic 
terms corres thy to the second ordet but fives the sec ular terms “exactly *s Le Lo arbitrarily 
high ordet | Wish to acknowledge very explicitly however, that | ai creatly indebted to 
Izsak ior the introduction of one of the sets of orbital elements that | use Knowledge of 


this set, which permits exact factoring of a certain refractory quartic, has influenced mv treat- 


ment of the whole problem 
2. Statement of Problem 


If op. n,oare the oblate spheroidal coordinates introduced in 1} and ur, 6,0 and bf Tia YA 


are the corresponding spherical and rectangular coordinates, then 


XV+i)Y=r cos 6 exp id p c l n*)| CXp i, (2.1) 

Z rsin @ pn, ( l<nsl 29) 

Phis work ipported by the U.S. Air Force, through the Office of Scientific Research of the Air Research and Development Command 
I ck t the end of the paper 
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Here r is the geocentric distance to the satellite and @ and @ are respectively its geocentric 
declination and right ascension. For sufficiently large r, p~r and »=sin @ 

With the origin taken at the planet’s center of mass, the intermediary orbit is then the path 
of a particle in the approximate potential field 


Va up(p--+-en)'," 2.3 
where y is the product of the gravitational constant and the planetary mass and where 
e=r.J, 2.4 


Here r, is the equatorial radius of the planet and J, is the coefficient of the second zonal harmonic 
in the expansion of the planet’s true potential in spherical harmonics. For the earth .J,— (1.08) 
10~%, to three significant figures 

According to [1], the coordinates p, », @ satisfy the following equations, involving quad- 


ratures: 


t+ 6 t pF i(p)dp ie’ n?G?(n)dn, 2.5) 
By tT F (p dp a G n dn, 2.6) 
o—p Cas (p? +c? I p)dp+a l— 7’ G nidn 2.4) 


Here F(p) and G(n) are the quartic polynomials 
f n | ‘ 
F(p) C*ag-+ (p?+ c? a6 2up-+2a;p"*), =.) 
2.9 


G(n) ag l n* ag 2a\¢*n* 


The a’s and @’s are the Jacobi constants, with the energy a,<0 for satellite motion and with the 
polar component of angular momentum a,;—0 according as the orbit is direct or retrograde 
To orient oneself, note that in the limiting case c—>0 of Keplerian motion the separation constant 


a, reduces to the total angular momentum, — 8, to the time of passage through perigee, 8) to 
the argument w of perigee, and 8, to the right ascension Q of the ascending node. The Jacobi 


constants may be determined, at least in principle, from the initial conditions; we have more 
to say about this point later. The constant p, is the next-to-the-largest real zero of F(p) and 
thus is that zero of F(p) which is closest to the smaller zero of 


Tip aT 2up+2a)p*. 2.10) 


To solve (2.5) through (2.9) for p, , and ¢ as functions of ¢, we must first solve (2.5) and 


(2.6) for p and n and then substitute the results p(¢) and n(t) into (2.7) to determine é(t). Todo 
so we must first evaluate the above six integrals, which we shall obtain in terms of certain uni 
formizing variables. In turn, evaluating these integrals presupposes knowing how to factor 


the quartics F(p) and G(n). 


3. Factoring the Quartics: Orbital Elements a, e), i, 8), Bo, By 


In the case of elliptic motion (e=0) the perigee and apogee radii r; and r, would be the two 
zeros of f(p), viz, 


r=—a,(l1— ep), (3.1) 





where 


a La . 
) 
( 2a) A5u 4 
In our present problem with «+0. we mav still define constants a) and e, by (3.3) and 
4 as well as another constant 
COS Ay a 3.0 
The constants d ( 5.6 and 8 constitute on POSS hle set of orbital constant We miay also 
introduce the corresponding Seri latus reetum p defined hy 
jp) a | ( >.) 
so that 
t ip) 7) 
A determination of a a and a. would then furnish a é and 7 If the subscript + denotes 
initial values, then per unit Thass 
r / rv ( Pe! 


where / means speed and a superscript dot denotes the time derivative Also by equations 


+0 9] 13.2 10.2 nal mo) of fT] 


a | n p cn ” ' a C77 ( 1 n 35.10 
tial derivatives would determine the 


TI us a knowled re ot the nitt il coordinates ii! dl them im 


hus the constants a ‘ mad 


\ knowledge of their numerical values would then permit a numerical solution of the 


quarth equation F p 0 and tl is furnish the numer. il values of p p», A and B necessary to 


I f 2a;(p p f f p Ap B 3.11) 


where p, and ¢ ire the zeros of / ») closest to the values and Then, in the intermediary 
orbit pis restricted to the mterys il Oo, >p>9p betwee! two spheroids 


By equ iting the cor fTic ents of corresponding powe rs ol p ith ’ 1] and 2 ‘ we find 


with use of (3.3) through (3.7 


$y beginning with the zero-order solution A= B=0,  p;+po=) a pip ry AoPo, 
one can solve this set of equations for the four unknowns A, B pi +po, and pip, by a 
method of successive approximations. — If 


y O3/ Aly COs iy ? 1S) 
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the second-order solution, through £2, is 


A Qhopoy*[1+-ko( 222-3277 —4 4+ SP? ee | 3.19) 
B=kypi(l—y?)[(1+h(4y?—27) + 3.20 

Pit po= 2px” {1 —kor?y? — hp? (22? — 3.2777 —44-8y? ; (3.21) 
Pip2= pax *[1 +key? (a? —4) —h2apP (122? — 2! — 202777? — 16+ 32y? + 24? 3.208) 


The constants 


a (pi p 5.233) 
p ) 
é 3 3.24 
p p 
Pp a l ‘ 5.25 
will occur throughout the evaluation of the p-integrals. In terms of do, é), and 7», their values, 
f 
to the second order in ky, then satisfy 
p/p l- hoy? r?>—2)-+ key? (3x 22r*y? — 16x? + 242777? + 16—S2y? 3.26 
», (P= 1+-2Zkoy?(2—2?) +- hey? ( — 321+ 62/7 + 1622 — 40277 — 164-4877 3.27 
Po/T Y Y 
1—e? Soe 
l hy? (32? } } y par 4§— 2 er*y? — 202? Surry 16 \2y » DR 


l 2\3 l l a 
( ; )? 1+; hoy? (3.2? 4)-+ = key? (2024 Lj ar*y*?— SOs? +- 136277" + 64— 1447/’ 


(i e*)ip 1— as (1—e) 4 14+-4 hoy? (4—2?) +3 hi? fr*+- 7 arty? +482? 
104.277? — 64-+- 1767" > i 3.30 


In those places where e occurs alone in the theory, i.e., not in the combination 1—e?, it may 


of course be found by use of 
By (3.28) this results in 


so that when é is comparable to ky it is not feasible to expand ¢ in a power series in ky; indeed 
if é>==0, we should need a power series in kj. 

Direct use of this second-order solution in factoring Fp) will lead to p-integrals that have 
secular terms correct only to O(k3). Since we are aiming at arbitrarily high accuracy for the 
secular terms, we include it here for other purposes. The most important of these purposes 
is to furnish information about the orders of various quantities in ky; e.g., A and B are both of 
order ky. Such information will be necessary in carrying out the solution of (2.5) through (2.7 
for the periodic terms. The second purpose is to furnish a convenient starting point for any 
investigators who may choose to use do, é), and %, along with the @’s, as orbital elements, and 
who will therefore have to solve the equation /(p)=0 numerically. A third purpose is for use 
in calculating the mean motions to first order, for comparison with other theories. 

The quartic G(n), which is quadratic in 7’, may be factored either as 


G(n) = —2a,c?(n5—n?) (n> —n?) (3.33) 
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3.34 


Gin a5 —ag)n*(n *—n n N: 
If we use the latter form, we find on comparing (5 54) with (2.9) that n and ns? are the roots 


of the following quadrati equation m 


y min Pant y Pay () g 35) 
Thus 
? : ry Part a x | a 2 >.0) 
7 } r Jaye y x | Wi , 7 
where 
iH ] Sai a a ¥ Jay roe 
Krom (3.36) an 3.38) it follows that for a,< 0 
7 Ob, t a | >.) 
and we shall see below that for satellite motion >] Since 7 cannot exceed 1, it follows that 
the actual motion » always lies in the interval nox nXm between two hvperboloids 


We readily find that 
sin ] Vhyxty 1 fry vy } — ‘ 5.40 


Note that ’ . i 10) so that > 1000 


4. Factoring the Quartics: Orbital Elements a, e, J, 3, {2, 


with those ol corresponding powers 


we eq inte thre coethe Wnts ol pow rs ol 7 Ith 


-) throug! 15) we use 


Suppose we now regard a, ¢ and 


as known When we adopt a, ¢ /. and the 6's as orbital eleme nts we are certainly assuming 


so: we discuss later how thev may be determined Then in treating the p-integrals we have 
five unknowns, Viz, do. €. 7, «4, and B, and in treating the y-integrals one additional unknown, 
VIZ. 7 Altogether then, we have six unknowns and we have six equations with which to 
determine them +. ] through 1h With these orbital elements, viz, a, e, and m, however, it 
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turns out that the system can be solved exactly and with considerable ease, for the required 
unknowns. ‘This property of a, e, and n was first pointed out by Izsak [4]. 


To carry out the solution, first eliminate m from (4.1) and (4.2). The result is 
a5 —a — Cn l—n 
sin- 7 nit 1S 
a ay p 
On inserting (4.8) into (4.6), we find 
Bap C7ne [dopo +c?(1— nj) ] (4.9 


If we now use (4.3) to eliminate a) from (4.5) and (4.9) to eliminate ap» from (4.4), we find a 
pair of simultaneous linear equations for A and B: 


(ap—ec?) A—2aB 2ac*, $10 
2c*nja A+ (ap—c?n?) B=e?n?(ap—en 4.11 
Their solution is 
2ac-(1 ni) (a p—e ng 

A=— “. = 1.12 

(ap e)(\ap cn ; 1a°c-n 

> (ap Ce ap CoN, ta*¢* 
B=« l ae 1.13 


(ap—c*) (ap—e*n;) + 4a cn? 
Then, from (4.3) and (4.12) 


be a) A c*(1—n) (ap—e* ng 114 
2aa, a 2a (ap—c*) (ap—e*n; $a*c*n 
and from (4.9) and (4.13) 
ae i ap—c*)(ap—c*n;) + 4a7ec? i 
5 — =A Po c*(1—ns) +ap f ; l = : ————— (4.15) 
a} ap ¢* \ap Cn, ) ba*c?n 
Equation (4.8) and the relation m)=sin / give 
Cnn \* . 
a a ( l dP ) cos I, $.15a 
so that (4.15) and (4.15a) determine a Finally, to obtain m, combine (4.2) and (4.6) to 
obtain ny ?=c'n§ (Bap)~' and then use (4.13). The result is 
, & (ap—c*) (ap—eng 4a*c*n 
N2~ ; : : : 1.16 
ap (ap—c*)(ap—e*n,) + 4a°c’ 


This completes the solution for the required unknowns when the orbital elements are a, ¢, 
and /. In terms of these orbital elements we can now factor the two quartics F(p) and G(n 
exactly and thus evaluate all the integrals. 
With use of the old oblateness parameter ky =c?/p? and a new one, suitable for use with this 
second set of orbital elements, viz, 
k=c? p / pds, (4.17 


we can readily show that, to the first order, the equations of sections 3 and 4 give similar results. 
Thus we readily obtain 


A =—2kypo cos? 7 2kp cos* TI, (4.18) 
B =kop§ sin® ig =kp? sin? J, 4.19 
dy ° 

: 1+ko(1—e?) cos? 7 1+h(1—e?) cos? J, t.20) 
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1+2hy(1+-¢3) cos? ¢ 1-+2h4(1+-e?) cos? J, 1.2] 


P 
l é i 
i Fe ] H€,) COS* 7 l ‘ae BY, cos /, 4 929 
sin 7 ] | 
t= | é cos* 7 ee 5 e*) cos" é $95 

sin J 2 2 

n ko(1—e k(1—e $24 
No/ 22)” kl ( sin? 7 kl ( sin? J 4.25) 


With either set of orbital elements, the final solution will be given in terms of a, e, 7, 8, 
Db and B., and certain angles kk and y, analogous respectively to the eccentri anomaly, the 
true anomaly and the argument of latitude in elliptic motion Once one knows a. e. and /, 
one can then determine the 8’s by observing, at various times, whatever quantities will best 
serve to determine Kk. D, and y 

If one is using dp, é, and as elements, one has to determine them from initial values or 
Irom some procedure equivalent to determining initial values One then has to factor the 
quartic /(p) numerically to determine a, e, and mn =sin / 

If one 1S using d, @, and / us elements, one has to determine them by following the orbit 
lol Peat revolutions and then applving some sort of iterated least-square process In this 
case one can then find the Jacobi constants a, as, and ay and thus do, é), and ig by means of the 
equations of this section 

Ln any event, the determination of orbital elements by comparison of theory and ob- 
servation, is ordinarily considered a completely separate problem in celestial mechanics from 
the cal ulation of the motion for viven orbital elements We have included the above remarks 
only to aid in the possible applic ation of the present theoretical solution Indeed the problem 
is further con plicated by the small perturbations that occur in practice, 

In the rest of the paper we simply assume a, ¢, mn =sin 7, 8), B, and 8; to be known and then 
con plete the solution for the intermediary orbit. There will be one restriction, however 


e orbital inclination 


The method of evaluating the o-integrals will be found Lo depend on tl 
or / For values of 7 less than 1°54’ or greater than 178°6’, a different approach would be 


needed We shall therefore restrict considerations in the present paper to orbits that have 
inclinations between these two values, thereby ruling out equatorial or almost equatorial 
orbits 
5. The p-Integrals 
In (2.5 through (2.7 the p integrals are 

R pl [ 5.1 

R | o?-+-¢2)-'F'-4(p)dp 5.3 
where Fp s viven hy 11 A and B by $4.12) and (4.13 + by (3.25 and p, and p» by 

p a(l é p a(l é 5) $) 
Equations (5.4 follow from (3.23) and (3.24 Then 

I} 2 p 2a p po Pp» p by l Ap Bo 7 5 5) 








If, for convenience, we put 


A=-—25,, B=b?, h=b,/b. 5.6) 
and 
h=b./p (5.7 
then 
(1 Ap I Bo . ‘ ] 2rh h ; DS 
>> A"P, (A 9.9 


if |Al<l and /A/< 1. 
The expansion (5.9) will be a convenient tool for evaluating all the p-integrals, whenever 


it can be used. To see when, use (5.6), (5.7), (4.18), and (4.19 Then to O(F 


b,=kp cos? J, b.=k'p sin J, 


so that b./p<1 and 


A\=k' cos? J ese J >0 
Thus to O(k), |A!<1 whenever 
kt cos? J ese 7<1, 
or whenever 
tan? J/+tan‘* J >s 
or 
2 tan? / 1+-4k) l 
To O(k), this becomes 


tan? J >k 
But k=(r,/p)*J, and J,.=(1.08)10~$ for the earth, so that A<c 1 whenever 
tan J|>0.033 r,/p 
For close orbits |p 1, so that the necessary condition becomes 
[.<I<180°—T,, [3 1°54’ 
The expansion will thus work for all satellite orbits that are inclined more than 1°54’ to the 


equator. For those orbits that lie closer to the equator one must Use some other method to 


evaluate the p-integrals. On inserting (5.7) and (5.9) into (5.5), we find 


F(p) ’ { 2a » > b3P A)p . p p p po ; 5.10) 
With use of (5.1), (5.10), and (5.6), we find 
*p , ~ 
{ 2a,)'R, (p-t b, )[(p ~~ Pp» pO }-3 dp + y de bP X 0 my 0 0 p» p)i do 
5.1] 


Each of the separate integrals in (5.11) is a multiple-valued function of p. It is appropriate 
to change variables in each to a uniformizing variable / or vr, analogous respectively to the 
eccentric and true anomalies in elliptic motion. We define / and v by requiring them to 
satisfy 


p atl é Cos BE l é€cosr Pp 5.12) 
and always to increase with time. Then, from (5.12), (5.4), and (3.23) through (3.25), we find 


[(p P1)( po p)| 1(+dp)=dkK l—é 1+ecos vr) 'dr 5.13) 
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5.1. The Integral R 


On introducing / into the first integral in (5.11) and into each of t 


series, We find 


he integrals in the 


2a R bh ie a ik e sin iD l e* p } I h Pp "p | é COS di 5.14) 
To mnvestigute the convergence ol the series we write 
S1=>5 (b,/p)"P.) | (1 +e cos v)"*d 5.15 
h p > Te h p pP X ] € cos j ] 16 
Then 
S h p pe h p PP \ | é dD 17 
ind since [?)(X | for all A |, we have 
bo r 
S h bp > 4 LS 
h./p 5.19) 
1—b.(1 +e 
By (4.19) and (5.6), however, 6 p ki sin J and of course 1+eS2, so that 
b.(1+-e)/pS2k' sin JS 2h 5.20 
Since | 10~>°, we have 6.(1-+<« p <0.063 Thus the series S, converges, and converges more 


rapt lv than a creometric series ot common ratio 116 Actually , since we have shown that the 


series S; converges absolutely, we can regroup it into the sum of a series S,, containing only the 


even values of n and a series S containing only the odd values of n. It is then a sunple matter 
to show that S s Ss eonverges more rapidly than the geometric series |] b €)/p 
>) 501 ( p\’ That is, we can actually expect the convergence of S, to be as rapid 
as that of a geometric series of common ratio [b.(1 ¢)/p\- 1/ 1/250 


To decom post the series S, into a part proportional to and a periodic part, note first that if 


/ j | € COS di 


then 7 ) 2 T is an odd function of », « 


we obtain 


{ period 2r But f,,(22 2h», (4 


we 


To obtain, to any order in &, the parts of the p-integrals proportional to v, we shall need to con- 
sider all integral values of m in w~'f,,(4 To obtain the periodic parts correct to order 
shall need values of m only up to 4 
To obtain a convenient expression ior the v-term ot f r), note that [5 
rr cos / rP 
for all values of 2, including real values greater than unity Here P is the 


polynomial 
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of the same polynomial form in z that holds when |2|<(1, when it can be defined by the usual 
generating function. If we put 


l e- 5 25 
in (5.23), we find 
°r 
| (1+e cos v)"dv=27(1—e?)”2P,,[(1 —e?)=4] 5.26 
eR,,(V1—< 5.27 
where 
R,, (x) z"P.(1/r) Osrs1). 5.28 
Thus F,,(z) is a polynomial of degree [m/2] in x. The first fewof these polynomials are given 
in table 1. 
TABLE 1 
= Re 
2 $ 
5—3. 
| 45 —30 ; 
&) (63—70. 
t 23 ; 
From (5.22) and (5.27) we then obtain 
bd 
(1 rt é€ COs r)"dr R (v1 e* r+> > Cc, - sin 7?. 5 29) 


1 


Through m=4, the coefficients ¢,,; are easy to find, simply by binomial expansion of the integrand 
and conversion to a trigonometric polynomial. 
The results are given in the following table: 


TABLE 2 Coe fic rents ¢c,,; In » 29 


In (5.14), bJ=O(k"") and P,(A)=O0(\°) = OC} if nis even or P(A Or O(k) if nis 
odd. On inserting (5.29) into (5.14), using table 2, and keeping periodic terms through O(k*) 
only, we find 


( 2a; IR, b,E+a(Ek—e sin E A,r- >A Sin 7?, 5.30 
where 
1 (1 e*)ip >) h p)"P r R \ 1 ( } 5.3 
and 
A (I - Ph b2 p /' ‘ 5 2 
4p . 
(1 Ee" }* ‘ 
1h, 29 bie 5 > 
2 Pp 


The above proof of convergence of the series for R, also shows the rapid convergence of the 
series (5.31) for the coefficient A, of the v-term. 
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5.2. The Integral R 


On inserting (5.10) into (5.2) and using (5.12) and (5.13), we find 
2a R é p y h p Fi l e cos v) "dr 5.34) 


As above, one can show at once that this series conve reves more rapidly than the reometric series 
PD oe -olbo(1+e)/p]", with common ratio 6,(1+-¢)/p <2k!~0.066. The same proof then applies 


to the coeflicient A in 


[ sing the same methods as for ?,, we then find 


l ‘ p p> h p P \ Re \ 1 é OD 
and, through periodic terms of O(/ 
| |—e?)*p-'elb sh hb?) p bh b3(1+-e/4) p hi(4 562) p 5.37 
‘ 4/34 
| l é Pp S hy h p > é hh p > te ‘ pP ] 7.8) 
| é p = h, h p hip 5 34) 
] é p Ai 5.40) 


5.3. The Integral R, 


If in (5.3) we now use the binomial expansion 


and (5.13), we find 


and insert the expressions (5.10 


"P.(\)(1+e cos v)"***dv, (5.42) 


where the integrand Is the pro luct ot two series each ol which converges absolutely for any 


value of Then [6] it is equal to the series formed by summing the products of the individual 
terms, taken in any order, and this resulting series is itself absolutely convergent, for any value 


iniformly convergent, by the Weierstress ./-test [7], so that it may be 


ol 7 It is therefore 
integrated term by term 
Let us now rewrite (5.42) in the form 
2a R l f }’ : D l € COS / 5 43) 


Here 
dD > *d.8.°. (5.44) 


summed over all those non-negative integral values of 7 and n’ for which 








and where 
d ( 1)?(e/p)*, (5.46) 


6,°= (6. p)" P..(X). (5.47) 


Then, because of the uniform convergence, 


(—2a;)!R;=(1 e*)'p >» dD, (1+e cos v)™*?dp, (5.48) 


It is interesting to investigate here the rapidity of convergence of the series 


S > dD, l e cos r)™t2dr 5.49 
If m is even, we have m=2i and n’=2n, so that 2)+2n—2/ and j=i—n. Then 
i 
D,=D, Sod 402, so { ] "ie pP - 2" h p sP. \ 5 50 
so that 
Dy, & ( p)? } & bai 5.51 


But e=kp* and b}~kp? sin? J, by (4.19), so that 


(b/c)? =sin? J <1. 


Then 
dD, <(1+1)k (5.52) 


If m is odd, we have m=2i+1 and n’=2n+1, so that 


i 
Do 44.17 :> > d,—nOen+1 y & ( l o) | all b p)?" Ps 5.53 

Then 
D ' <(c p ih Pp : 4 ( kk l l sin / 5.54 


Then, breaking up S into an even series S, and an odd series S,, we find 


S, s p > (i +] k l e)? vs l e)*v >> ! l h l e)- 5.00 
Using 509 z'=(1—2)~' and S)Piz'=2(1—z) ~, we find > l)2'=(1—z7)~?, so that 
6 l-+-e)*2 aia 
WD ro 
ai l I l é 
Similarly 
S,| Sk4(1+e)8vsin J >) (6+) [kil +e 5 57) 
_h (1 éey’vsind - 
a ros 
l k(1 é 
Thus, 
Si< (l+¢ v1 ki (1 sin/} 5 59 


“s 1—k(1 + 


The series (5.49) for R; thus converges more rapidly than the series expansion of the funetion 


given in (5.59). 
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Through O(k*) the values of 1), are given by 


TABLE 3 
do 
di I 
d +d\d i 
/ +d / 
‘ l | +d / i , 
As with PR, and Ro, we find 
2a,)'R Agr » & A Sin 7?, 5.60) 
where 
A l e*)3p a dD R 1 e* 5 61) 


The remarks about the convergence of the series (5.49) apply also to (5.61) 
Since 2, is multiplied by c?=kp? in (2.7), we need periodic terms only through O(4), in 
order to have periodic terms in the final solution correct through O(4*) By (5.43) and tables 


2 and 3, thei coefficients are 


A |—e?)ip aE hp (3456) p (F+e) ase) | (5.62) 


e h ‘ bh? 
A l e*)?p E "3p? ( 9 +-¢* )} (5 64) 


is e' b- 


6. The »-Integrals 


In (2.5) through (2.7) the n-integrals are 


V Gt n)dn, (6.1) 
\ G nidn, 6.2) 
\ 1—n*)~'G(n dy (6.3) 


where G(n) is given by (3.34), m by (4.16), a3 by (4.14) and (4.15), and aj—aé finally by (4.8). 


i 


In evaluating N, and N, it is convenient to put 
n= sin y, (6.4) 


where Wis to be an angle that always increases with time. In the limiting case e=0 we should 
have sin y=sin@/sin J and ¥ would thus reduce to the argument of latitude, 1.e., to the angle 
between the line of nodes and the radius vector to the satellite. 

Then 


G~*(n)dn= (eb —a8)~ no(1— ¢@ sin? p)~ td, (6.5) 
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where 
d= No/ 2 O(k4) <1, 6.6) 
m being given by (4.16) 


6.1. The Integrals N, and N, 


Insertion of (6.4) and (6.5) into (6.1) gives 


Vi = (ai—ai) | a q sin’ y sin? pdy 6.7) 
Because of the identity 
(1—@¢ sin? y)~4¢* sin? y= (1—¢@ sin? y)~! 1—¢ sin? y)), 6.8) 
Ni (as— az) nog ‘| l—q*sin* ¥ idly . l—q sin? ¥ av | 6.9) 
(ai—az) Iniq”? [Fi¥.qg)—E(¥,q)], 6.10) 


where F(y,q) and F(y,q) are respectively the elliptic integrals of the first and second kinds, with 
modulus 4. 
Insertion of (6.4) and (6.5) into (6.2) gives 


N.= (a2 —ag n (l—q* sin? ¥ dy 6.11 
o3— 08) nok (y,q). 6.12 


Thus N, and N, can both be expressed exactly in terms of elliptic integrals of the first and 
second kinds. Our ultimate purpose, however, is to express each uniformising variable as the 
sum of an exact secular term and periodic terms correct through O(42). For this purpose it is 
convenient to express each elliptic integral as a linear combination of y and a Fourier series 
>> B, sin 2ny. 


To do so, consider 


Fiv,q) : l—¢g’ sin? z ly 6.13 
Some simple transformations show that 
Fiy+-r,q Fid,g~i+2K(q), 6.14) 
where 
K(q | l—q* sin? a di 6.15) 


is the complete elliptic integral of the first kind. It follows that the function F V9 
— (2/r)K(q)y¥ is periodic in y with period z. 
Furthermore it is an odd function of ¥, so that it can be expanded in a Fourier series containing 
only the sines of even multiples of y. Thus, 


F (vq) =(2/r) K (gy ¥+ D5 Fom sin 2my 6.16) 


To calculate the Fourier coefficients F,,,,, differentiate (6.16) with respect to ¥ and use (6.13 


Then 


(l—@¢ sin? y) 


(2/r) K(q)+2>5 mk m COS 2my. (6.17) 
i 
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The Fourie! coefhi ents I are then viven by 
I 2/rm l1—g* sin*® z cos 2madz 6.18 


Since cos 2mz is a polynomial in cos*® z, each F,,,, can ultimately be expressed as a linear com- 
bination of K q and the complete elliptic 1! tegral 


| q | q- sin’ 4 da 6.19 


Such a procedure howeve! would not readily reve al t he ord r ol each cor {hic went mn q; whi h 


itself is of order 4 Actually we shall show that / is of order k™, so that we shall need only 
| and / Instead we expand the radical, obtaining 
Pn)re ! 
. ~ Sill i 
q- silt J 14 S t = 6.20 
/ iad 9 7 
‘} 
) » ! ° 
/ = > RE u Sin rcos él rda 6.2 
om = 2 yn! 
Also 
9 ' 9 ' 
=~! . " a/ 
Sil 1 \%) 2 > | > ; COs (27 2 | (6.22 
ana 
cos 27 2) r] cos Zing cos >) 2m ro r] + 1 cos 7 Jn 2 j 6.25 
The integral of this product from 0 to x/2 fails to vanish only i. } m or il ! " The 
value } mis absent from (6.22), so that only the term 7=7 m in (6.22) contributes to 
the integral in (6.21 Thus 
)»)! = 
Sin COs < rd l 2 = _ = / } 6.24 
‘ 7 TL TE] é 
Qn mst ing (6.24) into (6.21 we find 
, ' ( 
/ ' . or { G& 95 
— ' ~ ' = 
} n Z } 
Inspectior ol 6.25 now shows that / () q Oi The first two values are to OI ler 


q Ol orde! / 


/ - ~ (1 7 ( )+. é 6.26) 


ai > 
/ > ° 0.24) 
20D 
kor ou purposes, therefore. 
, q » og 
F(y,q) = (2/n) K(g¥—© (14-5 @ ) sin ¥+3-5 sin 4¥+.... 6.28) 
Ss t 256 


Similar considerations about periodicity and oddness show that 


I y q q- sin? £)*da 2 /x ke q y¥ > a sin 2my 6.29) 
‘ l 
Then, as before, 
] q- sin y Imk q 25 m Eom cos 2my 6.50) 








and 


“/2 
Eam=(2/rm) | (1—g sin? xr)! cos 2mrdz. (6.31) 
~/0 
Also 
' : ©, (2n—2)!q*" sin?’ 
1—¢ sin? r)!=1— > 6.32) 
( : nai 2°" 'n!(n—1)! 
Inserting (6.32) into (6.31), we find 
, 1 = (@a-—2)Ie* (et? . 
E,,.=— » » sin’™ cos 2mrdz. (6.33) 
- eM 4=1 2°"~*n!(n—1)! Jo ne 
On inserting (6.24) into this, we obtain 
Le) 
g (—1)™"*' = (2n)! (2n—2)! gq?" 
E m >» Sin—-1,,1 ' : , (6.34) 
m iaomo 'n! (n—1)! (n+m)!(n—m)! 
Thus £,,, is of order q’" or of order k” and 
. , 
g tf 
I a ta5+ 6.35) 
q 
E —_ i" 6.36) 
gs 956 ) ) 
Then, for our purposes 
2 4 ‘ 
, - gq tf ° ¢ . 
E (¥,q) = (2/n) E(q)¥+( r +2, ) sin 2y- a sin 4y¥+ . 6.37) 
ona aJIV 
Finally, inserting (6.28) and (6.37) into (6.10) and (6.12), we have for the »7-integrals 
N, and N,: 
. , ' l a ge . 
Ni =(aj—ag) in| Biv ( l+5 7 ) sin 2y + esi 4y+ ...], 6.38) 
~ » 
7 3q* 
N,= (08 — a4) ino Bav—% (4+-3 4") sin 2y +5= 6 sin4y+ .. | 6.39) 
> aJI0 
with 
2q 2 - ’ l 3 15 
> f a" 2 , 
B, - [K(q) Eq) ate [x 2 6.40) 
9 1 9 
B,=- K(q)=1- 24 — 
mae q 


pO tee me 
Here the terms in y are exact. 


In N, the sines are correct to O(k*), while in N, they are correct 
only to O(k); this is as much accuracy as we need for N,, however, since it is multiplied by 
c?=kp* in the first kinetic equation (2.5) 


6.2. The Integral N, 
From (6.3) and (3.34) we have 


” 
(ag—ai)!N;= 4 | (1—n?)~!(1— n?/n5) ~2(1 — 9?/n3) Ady 6.42) 
Then since 
= (2m)! 
(1 2/y2)-§—S* -| (6.43) 
oh is mao 2 "(m!)? WM: 
we obtain 
. , Gaia ™ 
(ag—az)iN : - —— . (6.44) 
°  £% ~22™"(m!)? 
where 
*” 
Ln (1—n*)~'(1—n?/n5) ~'9?"dn. (6.45) 





With 


L (1 n-) ] n?/n ‘dn 6.46) 
and F 
m—1 
(1—n?)~*9?” = (1—9?)-'— D5 9”, (m2 1), 6.47) 
we then find 
m—1 
§ Lo— DS Lan, m=1), 6.48) 
where 
L n*"(1—n?*/ni “hn. 6.49) 


To evaluate Lo, rewrite (6.46) as 
, . »\-3 ae 
L n °(n l (n no?) “dn (6.50) 


and introduce the new variable x, defined by the equation 
tan X l1—n-)* tany cos J tany 6.51) 


and the requirement that x and y shall keep in step. I.e., whenever y equals a multiple of 


r/2, xX shall be equal toy. Then 


Ni é 


esc’ yp l cos? J cot? x (6.52) 
and 
9” *=9, “cac’ ¥— 1] cot? J esc? x, 6.53) 
so that 
n dy cot? J esc? x cot x dx 6.54) 
and cot xdx=0 according as dn=0 in (6.50). With n=» sin Wits integrand becomes 
tan J! cot xX!) tan x dx tan J dx, (6.55) 
so that 
L tan J) x=n,(1 n *x. 6.56) 


To fit the angle xX into one’s knowledge of the corresponding elliptic motion, note that us Cc 


approaches zero, 7 approaches sin @ Then, by 6.53 


tang . oe 
x ‘Sin ( ) Qo q). b.04) 


tan / 


where @ is the right ascension of the node. 


To evaluate the integrals 1,,, put n= sin ¥ in (6.49 Then 
L "5 sin?" r dr, (6.58) 
so that 
I nov 6.59) 
To handle the cases n= 1, we rewrite (6.22) as 
) ! » ' 
= Lali ] (27h) — 
sin*” J = The | 2 = . cos 271 (6.00) 
27*(n!) 1 (b-pp) An }) 
thus finding 
n?” '(2n) by , ” wy | 1)’(2n)! sin 27y — 
Da. + 2-2 —y) (n= 1) (6.61) 


22*(n!)? r (n+y)\(n—y)y 
Insertion of (6.48), (6.59), and (6.61) into (6.44) then vields 
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2. (2m)!n>>" >. (2m)!n>™™ (2m)!n n=l yeRtl(2n)! 
2 > r ‘ +e ~ -Ne - : . ~ 
(ay —a5)' Ns Lo ad 2M 1)2 —noW¥ >> 2m 1\2 —y >> D2" '\2 > > 92 ' 
m=0 «< (m!)* mai « \(m!)* m=2 « \(m!)* 2 o (ni) 
(2m)! 72" ™—! 1)?(2n)! sin 27y¥ 
cg hg rey OA ae Bed > “se. 
m=2 «£ \m7:)* | (n } } D+) 
If in (6.62) we now use (6.56) and the relation 
») 
. (277 )1n 
(l—n;? p> k=» 
22m ( m!)? 
we find 
(as a3)'N3=nol (1—n?)-3(1 No“ x+ Buy > > B,, sin 2sy}, 
where 
B,=1 l—n; > Yn? 
1 2m ! ~— Dn 'n 
»? m!' — 2 n' 
and 
x 9 ! ») ’ ),,/ 
‘ (2m)!n ‘ | 2n)! sin 271 
\p 9.’ ’ 9 , . J¥ 
2, Bay sin 28'f=— 20 sear he > 
s’ =] m=2 = (+h l 1 nh J nH J J 


The easiest way to isolate the coefficients B,, is to use the orthogonality of the 


sin 2sy. Then 


(6.62 


6.05 


6 64 


6.060 


8) 67 


functions 


> ' 9) » ' 
Zm)!n - i 2 ] Jn 
B — >) - —— >)— , ; 6.68 
mast] 2 '(m aos n } s)'s 
(no 64 
so that By, =O(ny*)=O(h), By=O(ny*)=OF), ete. Thus, to obtain the sine terms of 
to O(k*), we need only 
B NN. 6.70 
4 
To test the convergence of the series >} y,,.»7?", note first that 
(2n)! 1-3-5 2n— I] l - 
: ; hi 4 
27*(n!)? 2-4-6 2n 2 
and hence that 
1/9 l= 
<0 2n)'n n ion 
, ) ' > Oe 
“ 2?"(n 2 
for any orbit, polar or nonpolar. Then, by 6.66), (6.71), and (6.72 
( m l ) ) . a 
5 2 ~ Mi hi y ) 
. Ss 2 16 . 
Thus y,,.< m and 
‘ _- 
>> YmN2 > mn 6.74 
But 
x ? 19 
om N2~ > n ~ ” _ 
3 myz? — ' 63 
2 i, *)* l—n 
where ny?=O(k). Thus the series 53) y,7°" converges rapidly. 
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To test the convergence of the Fourier series 5**, Bs, sin 2s¥, note that 


9 ' ) ' 
. . . ati ” . afi — 
> B sin 2sy <S B S S H.40 
ya nuns had 92 ' heed «DOF ' ' 
1 é hi e & fi ‘ fi 
But 
In)! Den) Gen fan? dy)! 7 F | } 9 n g | ny)! 
2°" (7 s)'(7 gs)! 22"(n!)2(0 "(7 : = n! n+sya7 . lL)(n+s—2 D2n(n! 
by 4 Ther 
Dy)! 
» & vs sii 
——/ <)> ) "> ' . 
21114 
" > ~ fi eal, 'n ~~ 
~~ 
Zu B Sill 2sy > <a iin 
by 6.7 aval Ther »\ 6.75 
- = 
>) Bs, sin 2.4 6.77 
Since 7 O(k), the Fourier series converges rapidly. 

Thus all terms in (6.64) remain finite for all 3 <1, except the term involving x, which appar- 
ently may become infinite for polar orbits Note, however, that NV, occurs only in eq (2.7 
for the right ascension, when it has a factor a3, which vanishes for a polar orbit To see what 
happens in this case we must investigate the limit of as.N as 7 | For a pola orbit we have 

a \ O(a a ij l n ” x ia 6.7 
But " ¥ ry % a to fla ind by LS mad +2 
¥e a | r | n Sin | r n to fla 6.79 
hus to fsla 
aeg.\ SZN a3) X polar orbit 6.80 

If we now use (6.51) to plot x versus y for various values of 73, we obtain figure 1. 

Figure | Plot « x y fx 
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The general form of these curves can be checked, as follows. From (6.51) 
tan X=e tan y, €=(1—n}?)!. (6.81) 


To find what happens to a polar orbit we must let ¢ approach zero. 
If we put 


y= (n+ 4)x+Ay, X= (n+ 3)r+-Ax (n=0,1,2.3, ...), 
then 


tan Ay=e' tan Ay. 


If we keep Ay fixed and let «—0, then if 0 Ay< 7/2, it follows that Ay->r/2. If = Ay<0, 


then Ay->—7/2. In a polar orbit, whenever the satellite passes over a pole, Y=(n+4)z, 
with ¥>0, so that x thus jumps by +7. By (6.80) and (2.7) the right ascension @ then jumps 
by +7 in a direct orbit or by —7 in a retrograde orbit. These are expected results, which 
had to be obtained as a partial check of eq (6.64) for VN 


7. Mean Motions 


The purposes of this section are to obtain expressions for the mean motions correct through 
the first order, to find if we are on the right track, and to obtain exact expressions for the mean 
frequencies for later use in section 8 in checking the secular terms in the final solution 

If p,, P,», and ps, are the generalized momenta conjugate to p, 7, and ¢, then the action 


variables 


N= palo 2| ‘pele, (7.1 
J. q p»dn 4 pln, (7.2) 
Jo p pedo Polo=2rasg, 7.3) 
are functions of the Jacobi constants a, a, a3. (Since these J’s occur only in this section 


there is no danger of confusion with the coefficients of the zonal harmonics in the expansion 
of the potential.) The a’s are then functions of these J’s and the mean frequencies [8] are 


given by 


mean p-frequency = v; =0a,/d-J;, 7.4) 
mean n-frequency VY. =O0a;/Od>, (7.5) 
mean ¢-frequency = v3= 0a,/OJ; (7.6) 


Note that », and »; are identical with the usual nodal and sidereal frequencies, but that », is 
somewhat different from the usual anomalistic frequency. 
To compute these frequencies, one may use the system of equations 


wh Oc Od m ce; e 99 
- Oln, payed}. 
m=1 Od Oa, Oa, 


~ 
t 
~I 
~I 


With the use of (7.4) through (7.6) and the abbreviation 


: = Od m Oa,,, (7.8) 
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these equations become 


with the solution 


With the use of eqs 13). (53). and (55) of 1] and sections (5) an 6) of the present paper 


we find 


Then 


2m ( 3) ing WAL BS! lat+b,4+-Ai+errzA. BB 7.23 

The above results are all exact, for the potential (2.3), and we shall refer to them again in 
section S 

For con parison with other theories it is desirable to express these mean irequencies as 
power series in the oblateness parametet For this purpose it is convenient to use the orbital 
elements lg, ¢ and i, and since we shal earry the series only through the first power we may 
replace 4 ( p by j ( p 

With use of the relations in sections 3 and 4, the above equations then become 


Ir } | 4 5, cos Oil 7 95 
where 7 s given by 
rv a i .26 


Similarly 


J ri » é Cos 7 Oke 4.26 


mS °r sen a rh r cos Oh 7.28 
which together with (7.14), (7.24), and (7.25) lead to 
2rv3/! sgl a l *h 5 cos ] ; ki cos Oh 7.29 
Here SUTL a ] according! as the orbit is direct o1 retrograde, respectively 


To avoid anv use of the con ept of an osculating ellipse, we may define the mean motions 
as follows We sav that the ascending node exists onlv when the satellite is over the equa- 


tor, travelling north Let q) be its right ascension at such a tune We then define the mean 
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motion Q of the node relative to OX by 


Q=lim ———_, 7.30) 
tro t,—t 

where Q and ft are the values of 2 and ¢ at some (ascending) node and Q, and ¢, are their values 

i nodes later. Since the present system is of the conditionally periodic Staeckel type, it 
follows, after some fairly close reasoning that I shall here omit, that 

Q=2r( v, Vo) SZN as. 228) 

We also say that p-perigee exists when, and only when, p=p, and we let ® be its right as- 

A p : . fh : 
cension at such a time. Then the mean motion of p-perigee relative to OX must be equal to 


the mean motion ® of its equatorial projection relative to OY and 


By : i) ®,, - 
&=lim 7.32 
ae te 
Here # and ¢, are the values of @ and ¢ at some p-perigee and %, and ¢; are their values 7 
p-perigees later. Then, again omitting the proof, we have 


b—2r(y v;) SEN as, 6.30 

Now let w be the are on the celestial sphere from an ascending node (when it exists) to 

the next p-perigee. Then, since the mean relative motion of these points must be equal to 
the mean relative motion of their projections in the equatorial plane, we find that the mean 


motion w of p-perigee relative to the node is given by 


w= (@—Q) sgn az, 7.34 


2r(v2—vr), 7.35 
with use of (7.31) and (7.33), for any orbit, direct or retrograde. 


From (7.31), (7.25), and (7.29) it follows that 


3kn Cos ‘a Ok?) 7 26) 


and from (7.35), (7.24), and (7.25) that 


w kn (5 cos? 4+ ] O(k?) 1.37 


Here no/2x is the frequency in an elliptic orbit with the same total energy. Equation (7.36 


agrees with results found by many other authors, as does (7.37) when w is the mean motion 
of r-perigee relative to the node. By (2.1) and (2.2), however, r?=p?+kp® (1—7’), so that 7 
and p differ by a variable quantity of O(k). It is thus a little surprising that the mean mo- 
tions of p-perigee and of r-perigee relative to the node should be equal through O(k This 


means that the mean p-frequency is equal to the mean r-frequency to this order. 


8. Solution of the Kinetic Equations 


Before solving the kinetic equations (2.5) through (2.7) it is convenient to have several 


relations connecting the uniformising variables KE and v. From (5.12) we obtain 
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COs kk é | 
Cos Tr . s\ la? 
] é COs kk 
The requirements that dv/dt>0, dE/dt>0 for all t lead to the result that dv/dé 0 for all ¢ 
Because of this result S.1a) leads to 


1—e*?)isin # 


sin 7 7 S.1b) 
| & COS k 
without ambiguity in sign. For a given value of /, eqs (8.la,b) determine modulo 27 
On LN posing the further requirement that v shall always equal IZ whenever the latter is a 
multiple of x, we find that # determines 7 completely Two other relations are often useful, 
VIZ, 
, |-+-¢ hk 
ans € ) tan 5) Sle) 
and 
hk ysin # ' 
tan - - ’ r l l—e* l 8.1d) 
2 l—y cos k 
Before beginning the solution of the kinetic equations it 1s desirable to assemble the 
results already obtained By 2.5), (2.6), (5.1), (5.2), (5.30) through (5.33 5.6 5.28 
5.35) through (5.40). (6.1). (6.2 6.6), and (6.38) through (6.41 


the equations for rh) and 7 are 


t+ 2a b,F+a(k—e sin E Ayw+- A), sin A,» sin 2% 


l . 
c?(ajs—a n | By 2-+-q*) sin 2¥-+4 q sin wy | +periodic terms of O(/ 8.2) 
\ 64 | 
B»/a 2a A.rt+ Ay sin 0+A sin 27+ Aos sin 37+ Ag, sin 47] 
aa | BF z i 1g?) sin 2y-4 se sin wv periodic terms of O(/ 8.3) 
Here 
f a(l ecos k atl ( le cosv 8.4) 
n Mm Sin vy, 8.5 
with # and v connected by any of the eqs S.1 
A ] e- p _ h Pp np hth Re e-)3 Ok . S.6) 
| ] é p > h p "pP h h PR ] é Ot 8.7) 
| ; | ( p ( Dh h p h Ot SS 
| = l U p hie Oh % QO) 
B ~ K q k q Of S10) 
md 
B,== K(q)=O(k 8.11) 
T= No/ N: Oh 8.12) 


K q and k q) are the complete elliptic integrals of the first and second kinds 
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n= (1—e*)'ep-*| —D Ap*+(2.42—B) p+? ABp (145)+2B24+30) |-00 3 
Ay, =(1—eé*)lep 7 P+ ’ 3) P+ - Bp (A+-7 )+-5B4(4+ 3) =((h), (8.15) 
An=(1—e)tp-*} (347-28) p24". ABetp +2 BY 3241) ou (8.14) 
” 8 2° /4 16° S 2 ‘4 
. FABe? 22,2 
Ay,=(1—e)'p~*e se P “| Ok). (8.15) 
Ag, — (1—e?)'p*B*e*=O(k*). 8.16 
296 
In the above equations c?=r2J, is regarded as known, as are the orbital elements a, e¢, 


No; B,, and Bo. Then P a(1—e*) and Ne, A, B, a, Ms, and Q@ are given by section 4. Also 
l . ; 
b,= —=A, B=B, R,(x)=2"P,,(1/r), with |z!<1. 
To solve (8.2) and (8.3), place 
E=E,+E,, v=0,+0,, v=v.+¥ a s0 


where the subscript s means “secular’”’ 
N, cycles in time 7; and if 7 goes thro 


r 


v=, 

Since we have already obtained exact 

can obtain the secular terms exactly. 
By (8.18) we can write 


and the subscript Pp “periodic.” Then if p goes through 


ugh N, cycles in time 7), 


— 2nN 
k p lim 7 2m S.1S 
0 
9 
lim 7 -==29y.. 8.19 


expressions 1n section 7 for and vy, it is clear that we 


We shall also obtain the periodic terms through Oh 


E,=v,=M,, (8.20 
where VM, will play the role of the secular part of a mean anomaly. 
Then 
E=M,+E,, v= M y=¥,+y 8.21 
We may obtain the secular solution of (8.2) and (8.3) independently of section 7, by 
dropping all the sines in those equations, placing M, and y=y,, and solving the resulting 
equations for M/, and y,. The resulting equations are 
{ 2a ) 2(a- b, t A,;)M c? (ae a n By t re) 8.22 
(—2a,)-! A,M a5—ag)~?n Ba B»/a 8.23 
with the following solution. 
8.1. Secular Solution 
vi 9 : B, (t+, C* Bo a n.B S24 
4 , a) }* : >? ~ 
(a- b A } B, C ne Ay B 
y ' , A, (t +8) + Boaz! (a bh A ” 
(a, — ag)? No ) ) a 
- ’ (a+b,+A Bo+e ne AB 
Comparison of these results with (7.22) and (7.23) shows that /,=2ry, and ¥,=2m%, as ex 
pected. We may now rewrite these expressions more conveniently as 
M 2rv; (t+; c* B MH “Ni B B; 8.26 
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y¥ 2rv,|t+-B Braz! (a b,+A As *} S.24) 
As a check, note that to O(k®), 2m 2rv.—No, so that 
V/ nol(t D e I 
nap 
y n,(t+8,)-+4 1 /¢)(] 
asi 
But, to order kk’, a u and A 1 —¢ r!' so that npeal(a.A Nod pi up (l—e Rs 
/ 2 / 2<49 PLB, 


since nad ul to Of 


Thus 
Ok 


a 
— 
~ 
oi 


as is to be expected, with 6, replacing « 
As a later aid in reducing the solution for thre periodic terms to Kepler's equation, 10 18 
22), by transposition of some of its terms: 


here convenient to rewrite (S aé 


f Oo ya ad h V/ 2a A \V/ "(ae a ” By S.28) 


8.2. Periodic Terms 


We shall pul successively 


E,=] y—¥ Step 0) 
/ ij I: Y,=Wt+y¥ Step 1) 
I: E,t+khit+k y,=Wtwn+Y Step 2) 


In step VU, we retain in the equations (S 2) and (8.3 only those period terms which are 
of (O] VIZ, Sil / In step 1, we retain 1n these equations all periodic terms ol ordet h OI ke 
In step 2 we retain in the equations all period. terms of order k 


but none ol higher ordet 
kL. or k?, but none of higher order In carrving out each step, bowever, we shall suppose that 


each quantity involved Is calculated to such an accuracy that the error is of ordet k- Then, 

effectively, & and Wo will all contain terms of order k& and &, as well as terms of order f 

I: and Ww, W ll contain no terms of order j but will contain terms ol order & and } k,. 
and y, will be of order / Such a procedure will greatly simplify the resulting equations. 


8.3. The Periodic Contributions E,, v), and y¥ 


On placing / \/ he \/ and yv y YY, In (8.2 and (8.3) and retaining only 
| ¥ 


the term sin / of the periodic terms, we find 


3 /a 2a LCV , , n B(v.+¥ 8.30 
On subtracting (8.28) from (8.29) and dividing the resulting equation by (a+-6 2a 
we find 
V+ &k e’ sin (V,+E V/ S.31) 
where 
ae 
‘ 8.32) 
ad h 


since b } Equation 8.31 is Kepler's equation for \/ k,, with an effective eccentricity 
Let us suppose it to be solved bv the most appropriate method, which will depend on the 


We then have \/ fey and ean then find V/ | by use of eqs (S.1 
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value ol ¢ ‘ 








On subtracting (8.23) from (8.30) we then obtain Wp as a function of v 
y <a) *(aj—ag)*n ‘ALB; ly 8.33) 


Here the coefficient Wo/? is unity to O(k°), but to follow the procedure outlined above we must 
not make such an approximation. Instead we must calculate it so accurately that the error is 


of order I 
8.4. The Periodic Contributions £), v;, and y 


Now, knowing M,, ¥,, Ey, v, and po, we place E=M,+F,+F,, v= M,+-%m+%, and ¥ 


¥ ¥ ¥; Into eqs (8.2 and (8.3), discarding only the periodic terms of order k?. Then 


t+, 2a) (a+6,)(M,4+k,+k ae sin (M,+k,+E AM 
C*(a a Ul} B Y¥ y , Sin 2y 2y 8.54 
Bp a { 2a ) A (\/ Vet 01) Ag Sith V/ r ! As sin (2\f 20 
5 — 0G nf B ¥,t+W+¥ * sin (2y,+2y ] 8.35 
Subtraction of (8.28) from (8.34) and division of the result by a+h 2a now ives a 


Kepler equation for /7,+-4,+E;: 
M,+hE,+ki—e’ sin (M,+k,+ FE M,+M,. 8.36 


where 


Here we have used (8.33) to combine terms in v) and ¥ The quantity V/ V/, is then a mean 
anomaly whose secular part is exact and whose periodic part is correct through order kh. It 
has no periodic part of order k°: this is characteristic of a mean anomaly, 


[t is not necessary lo solve the Kepler equation all over again If in (8.36) we put 
sin (M,+-k,+Fk (1—4F?) sin (M,+-E Ek, cos (M,+E OWE), 8.38 


the error is of order *. Then (8.36) and (8.38) vield a quadratic equation for £;, whose so- 


lution through terms of O(/?) is given by 


V/ e’ VW? sin (M,4+- E. 
| e’ cos \/ kk 2 l e’ cos \V/ Ie 


To find », insert : M Yotv, and k V/ I iy into eqs (S.1 and solve for 2 
On subtracting (8.23) from (8.35) and eliminating terms in ¢) and Yo by use of (8.33), we 


then find 


¥ Za a a ” B Ay A Sih \/ A Sith 2M Zt 8.40 
q > ; J 
t By* sin (2y 2y 
S 


(Note that the elimination of ¢ and y¥ would not have been possible if they had been carried only 


through order 4 


8.5. The Periodic Contributions E,, v,, and y, 


Finally, knowing M,, ¥., Eo, v%, wo. Ly, 1, and y, we place H=.M,+- F,+-h\+-fky, M 
, and y=y y ¥ y in (8.2) and (8.3), discarding only the periodic terms of 


order greater than 4 The equations become 


194 

















(M,-+ 


t+-8 2a a+b6,)(M,+E,+4i+F, ae sin 
| sin M re | sin »M 27 CA» a ” |B y¥ y y¥ 
l 9 gq’) sin (2y 2y,+-2y,)-4 a sin (4y,+-4y , S.41 
gr? Pe Se aes ae 
B»/a 2a 1,(.\7,+-1 1., sin (M,+0)+2 1.. sin (244,42 2 
1 Sin ,. »/ A, Sin t.\/ t a x ”n | B y¥ y y y¥ 
Y »/ »/ / ‘ / / ) 
5) t-+-3¢q*) sin (2y 2y 2y,)- 2565 q’ sin (4y¥ ty 8.42 
On subtracting (8.34) from (8.41) and discarding periodic terms of order & or higher, we find 
0) 2a a+b,)k,—aek, cos (M k EB Ay A;, sin (VM 
: | 
Ay. sin (2M 2) C*(a;—a n Biy ;¥ cos (2y 2y 
fT sin (2¥,+2¥,) +4: sin (4y,+-4¥ ] 8.43) 
S > 
The 
, V/ 
hk: " s.44 
e’ cos (M I: OF 
where 
\V/ 1+b E 1,,sinc.V/ 1,. sin (2.\4,+2 
a | i ' os ¢ ' ' q ' ' 
( 2a a : n By 5 M1 COS 2y Jy sin (2y 2y , jst ty ty S.4 





how that. to ordet a 


is easv tos 


VU,t+kE,+Ek E,.—e’s V.+kh,+Fk+Fk Vo+-M V/ 8.46 
so that VJ is the second-order periods term of a total mean anomaly 
M—M,+M,4+M 8.47 
correspol dit Yr to the effective eccentricity ¢ 
To find insert \/ and I: \/ I: I: I: Into eqs 8.1 aie st 
ol 
To find y sub vt 8.35) from (8.42) and diseard period terms of order / u 
The result is 
y Za a x y B | | COS VJ 2A COS 2M 2 A S iM 
: q , P Os 7 ov 4 eT ai / q sin J »/ ] s s 
A.,sin (4.1 / +4 B ly ( Py, +-2y bg Sin (2Y. + 2¥ 64a! ty ty 1 


This comple tes the solution, with exact secular terms and periodic terms correct throug! 
order 4 for k and y and thus for the spheroidal coordinates p and 7 
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8.6. The Right Ascension ¢ 


From (2.7 5.3), and (6.3) we obtain 


o=—£6 aa.N Cash (8.49) 


where x is an angle that always equals yy whenever y is a multiple of w/2 and which also satisfies 


tan X 1—n;)? tan y 
The expressions for A. and the As,’s are given in (5.61) to (5.65) and that for B, is given in 
(6.65 With the secular parts of » and y exact and their periodic terms correct through order 


kh’, the right ascension @, as viven by (8.50), has a secular part that is exact and a periodic part 


correct through order i To check the secular part ol o, note that one can obtam it from 
8.50) by placing x ¥ ¥ and » r and discarding the snes If we do so and also use 2rv, 
and ¥,=2mvm, we find 
Dp 2re*a,( 2a A v 2ra,s( as a *” B ] 7 ” J l J ‘ 1) 
on comparison with (7.18) ane 7.2] Thus we find 
@ @o Ir Q 5) 


by (8.51) and (7.14), a result known to be correct [8 


A summary of the principal results of the papel follows in section 9. 


9. Summary of the Solution 


We assume that w and ¢ are known, where u is the product of the gravitational constant and 


the IDwass of the planet and where c ; J . being the « quatorial radius and m the coefficient 
of the second zonal harmonic of the planet’s cravitational potential For the earth J, 
(1.08S)10 


If V, ¥, Z are the usual rectangular coordinates of an artificial satellite and if r, 6, ¢ are 
respectively Its planetocentric distance, declination aie rig'it ascension, its oblate spheroidal 


coordinates pon, Oare Given by 
\ tg rcos 6exp+@ p ( | n CXP 19, 


Z=r sin 6= pn, Sn! 
The potential 


then fits the even zonal harmonics exactly through the second and, in the ease of the earth, 


approximately through the fourth Solution for the motion with such a potential thus furnishes 


a very accurate intermediary orbit. Since this potential leads to separability of the Hamilton- 
Jacobi equation, the solution is given implicitly by the quadratures of eqs (2.5) through (2. 


~] 


The Integration constants are the Jacobi a’s and p's 
If the initial conditions are known, one can readily evaluate the a’s Then if one can 


evaluate the integrals in (2.5) through (2.7) one can also evaluate the 6’s Evaluating the 
integrals dep ‘nds on factoring the quartics F(p and G(» The factoring of G(n) is immediate, 
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since it is quadratic in 7 To discuss the factoring of F(p) we introduce, in place of the a’s, 


the orbital elements a p/2a,, € | + 2a,a3/u"}?, and 7 cos Qs / Ay 
With p varying in the range p;S pS p2, we write 
k 2.2 21. a2 2-19 ») 2 ») 2 1 / > 
p Ca p ( a ~Mp -%)p eae: p p p p p sip ) 
and find that A, B, a, P, and e occur in the p-integrals 
Here 
l . tp p . 2p p p p 
ad p po a ’ p a l ‘ ’ ( é 
2 p p p p P» p 
Thus A, B, p p and Pp); p. are needed I! we put 
p ay(l ¢ j cp M l ( 7, Qs a cos } 
then through O(/ 
| ko poy?|1+-ko(2a ry $-+-Sy 
B=kypeAi—y?)(1+hky(4y?— zy 
2a=p p 2 Pal 1—kywr?y kee r2y? (2a ie) | 1+Sy 
ap=pip Pos 1+A yr (a } key?( 12a J 20r°y 16 2 Y- rp 
If ao. « , Di, Bo, Os are used as orbital elements, we then assume that the quarti fk p) 18 tas 


tored numerically, to as great an accuracy as may be desired, with the aid of the above 


as a starting point 


With » varying in the range—1I: MLNS mL1, we write G(y , 1 —*)(a5+ 2ayc*s 
Panc2(n2—n?)(n n*) and find 
” £ Jayt r So; ¢ ct a 3 
( }== : 
n é a \ se (re 
Here »>1. Then all the quantities a, p, ¢, A, B, m, and m are known in terms of the orbital 


elements a i and 

If we assume that the orbital elements are to be determined bv an iterated least square 
fitting of the solution to many revolutions in the orbit and not by initial conditions there 
better set of elements, introduced by Izsak [4] These, viz, a, e, J=sin™'n, 8 %, ands,, al- 
though not easily found from the initial conditions, result in unmediate factoring of the quartics 
k p and G(n We therefore give the solution in terms of these quantities, with the under 
that thev are to be determined eithe by the least-square fitting or from initial 


standing 
ditions by numerical factoring of F(» 


Criven xu, ¢ and the elements a. ¢ / D D Dea, 


compute 


m=sin I, 
p=a(l—e?), D=(ap—e) (ap—e'n?) +4a%en D’ =D+4a*e(1—n 


A 2acD | nN) (ap Cn B en2D D 








Restrict considerations to the case 6,/b.51. Then 
I.s 1=180°—I,, 


where J,=1°54’, approximately. Equatorial and almost equatorial orbits are thus ruled out 
Then com pute 


A (] e*)'p>3 (be p)"P, h h. R. ol (1 e* 


Az (1 e*)*p 3 (b, p)"P,(b bh. R ] e*)? 


where P. r) is the Legendre polynomial of degree hi and where Ra t P, J always a 


poly nomial of degree |n 2] mx 
A (1 e* *p > *D R a (1 é 
where 


D » > i” "1a p)* it bh. p 22P, (b,/b 


= 
dD y > ] "le p : 2" bh. p) P 4 h 
B,=22-'q°?|K(q)—E(q)|, B,=2x-'K(q), 


where A(q) and E(q) are the complete elliptic integrals of the first and second kinds/respectively 


‘ 2m ! I (2n 'n2’ 
B,=1—(1—n;’ SS ym ; where S ‘ 
2 — 2 2 m' — 92" 1)2 
The above SeTies all converge rapidly 
A ; | ‘ pP | 2h hep hs . A D p ] e? bie . 


A. | r mg [ p 3h: b2) p : = b h ( 1+ c ) P 4 oa } 2 2 p ‘| 
5 
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cn? A,B, B 


n A,B, a+b A Cn A,B, B 
M | l V/ , and 


then given by E 


The uniformising variables /, v, and wy are 
¥ y¥ If ¢ is the time, their secular parts 7, and y, are given exactly by 
V/ 2rv;(t+Bi—c* Bra, ‘7 BB 
y¥ 2rv,\t bp DQ, a b A A; 

Let the periodic parts be split as follows: k ke ke E,, F and 
v,—¥ yo, where, e.g., /) contains terms of order k°, k, and k*, £, contains terms of order 
k and k*, and £, contains only terms of order & 

Then &, is given by the Kepler equation 

U,+E,—e’ sin (M,+E V/ 
where « a(de b ( The term is then given by placing d V/ ry, and & V/ / in 
the anon aly connechons 
= l—e cos E sin F 


or equivalent relations 


equation 


COS COS iD ‘ ] é COS iD Sill 
here is the original e and not the e’ in the Keple1 


Note that the ¢ 


The Nn 
9 


i 
b 


is now given by 


term £ 
I: 1—e’ cos (M 


where 
\/ a } A < A B B ; 2a a y n, Sl 2y Jy 
The term is then given by placing V/ and & \/ kk i, in the anomaly connec 
tio? Then 
y 2a a r B | | Sill V/ A sil 2M 2 T iB Sin 2y 2y 


The spheroidal coordinates p and mn ure the 1 


is found by placing 


His and 


rivel by 


sin w, where 


a | é COS I | € COS p 


E=M,+£,+£ 


j 


Fk, M+ tet, y=v.+¥ 








The right ascension is 
ra) re} Oe (OG ag) *n [ l No) l N»~) 7 B y + = NH Ne ‘sin 2» | 
22 ”) 


‘ 
Ca ~a i v+ >> A sin ne | 


Here X is an angle that equals y whenever y¥ is a multiple of 2/2 and which also satisfies 


tan X=(1—né)' tan y. If ag is positive or negative, the orbit is respectively direct or retrograde 
The above solution gives secular terms of the intermediary orbit exactly and periodic 


terms correctly through order k*. There are no long-period terms 


10. List of Symbols 


We here list only the svmbols that are most frequently used, civing for each a short defini- 
tion or the number of the equation in which it first appears. Note that the first digit in each 


such number is the number of the section 


Symbol Definitic r equatic whe Symbol De 
a—3.23; element of 2d set Pe vetion variables; (7.1) through (7.3 
a 3.3; element of Ist set Se 7.8 
13.11 (also 4.12), } 3.16; c?/ po, 
| 5.31 j 1.17: ¢/p 
I 5.36 K\q 6.15; complete ellipti tegral of first kind 
1 5.32 modulus q 
| 5.33 | 6.46 
l 5.37 through 5.40 for j me LL. 6.45 
1.5.61 / 6.49 
\ 5.62 through 5.65 for 1, 2, 3, 4 V,—8.20; secular part of ‘mean anomaly 
hb 5.6 VM 8.37, 8.45; periodic parts of “‘mean anor 
$3.11 (also 4.13 ily of orders k and | 
B,—6.40 N,.NeN 6.1 through 6.3: n-integ 
; 6.41 l 
B,— 6.65 p | 
B 6.64 (also 6.68 P Legendre polynomial of d 
e—2.4 (fundamental distance in potential and qn” 
in definitions of coordinates 2.1 and 2.2; geocentric distal 
d 5.46 equatorial radius 
dD, 5.44 (also 5.50 and 5.53 3.1, 3.2: zeros of flp 
3.2 element of 2d set RRR 5.1 through 5.3: p-integrals 
3.4: element of Ist set R (2 z°Piilz 
i “li a ign of @ 
BF 5.12; “eccentric anomaly,” /—time 
Rk 8.17; secular part of EF V 5.12 true anomaly 
BE 8.17; periodic part of 2 8.17; secular part of V 
ky, Fy, ks—various terms of E 8.17; periodic part of 
q complete elliptic integral of second kind Various terms of 
modulus q, | 2.3; the potential of this paper 
Eiy.q)-—ineomplete elliptic integral of second kind X,Y,2—2.1 a 2.2; rectangular coordinates 
6.0 and 6.10, 8.17 l-—ep 
tip 2 10 / 3.18: as/a COs 
F(p 2.8 a total energy, first Jacobi const 0 for a 
Fig.q incomplete elliptic integral of the first: kind satellite orbit 
6.9 and 6.10, a second Jacobi constant 
Grin 2.90 a Z-compone nt of angular momentum 
h—5.7 third Jacobi constant 
l 3.5: element of Ist set 3,. 82.8 Jacobi 8's 
i 1.7: element of 2d set + 8.1 d, 
/.-value of J for which A= 1, ym 6.66 
J 2.4; coefficient of second zonal harmonic of 6,-—5.47 
potential, € 6.81 
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Symbol Definition or equation number S; mbol Definition or equation number 


7—2.1 and 2.2; a spheroidal coordinate —sin 6 x —6.51, 

as ro, ¥—6.4; a uniformizing variable analogous to 
no——3.33, 3.36, 3.39; during motion the argument of latitude, 

1S—m Sn Sm 21 ¥.—8.17; secular part of y, 
nz 3.38, 3.37, | ¥,-—8.17; periodic part of y, 
i] > 1 and 2.2; geocentric declination vo.¥1,¥2—Various terms of y,, 
A— 5.6: b,/be o—mean motion of p-perigee relative to the 
¥1,P 2,3 7.4 through 7.6 ascending node 

p 2.1 and 2.2: a spheroid il coordinate > 2 mean motion of the ascending node relative 

sr to OX, 

2.1 and 2.2: geocentric right ascension; the 


iird spheroidal coordinate 
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Note on the “Baffled Piston” Problem 


F. Oberhettinger* 
April 7, 1961 


King’s integral expression governing the problem of the time-harmoniec motion of a 


piston in a rigid wall is simplified 


In a previous paper [4]* the case of an arbitrarily 
corona l t(t?+-27)—4 exp [—7(# f(t)dt 2) 
not necessarily time-harmonic) moving disk in a . exp } {(tdt, (2 
rigid wall was discussed. The starting point was 
the eq | 12) in the above quoted paper, where 
‘ 0) O<—« 
a r*-+7*) *exp r++ 4) J, (rp)J,(ra)dr, l t—p 
« t- p* a- 
fit ra arccos , f { a f p 
l 2pt 
vhich, when y is replaced by ik (k=2z/X) repre- 
sents the velocity potential for the time harmonic Acain a before two casé \ und B have to bye 
; . a : . | Again, as . ases A { lave e 
‘ o’s ral [2] . stirs me | 
ise (King’s integral [2] The inve tigation Ol | ditinautshed ete: ~gyiae 
transient phenomena as carried out in [4] did not 
nvolve l directly, but its inverse Laplace trans Case A, a a 
form with respect to y, and therefore not much 
ttention Was paid to (1) itself But the computa- fa O-? . 

. " f ) 
tion of the acoustic pressure distribution for the ; 
time-harmonic case makes an investigation of (1 f(t _ arecos i+ p*—a a ta 

atu ‘ . i i f p 
tself imperative. Numerous approxunate evalua- 2pt 
tions of l with 4 replaced by ik, k>O) have 0 i>a ’ 
been carried out. (for instance, [1], [2], [8], [5], [6 
lt may be noted in this connection that (1) can be Case B, a 
onsiderably simplified in as much as that it can be 
transformed Into an integral expression with an / 0 a 
elementary integrand which is extended over a | J(é ra arccos Df p—a<t< pra 
rite interval of integration However the ex- = 
pressions (22 and (23) in [4] for the transient field 0) otherwise 
an be obtained as easily from this simplified eX- ‘ 
pression for l | Therefore, inserting f(t) into (2 
We proceed now to replace the factor ~?-9 7exp | 
r?-+-4 by the integral formula in [4(9 Case A, pa: 
exp T ~y~)3 ° 
{ f(t exp vit [ 
Ji irt)t(t . ? exp y(t $| df "a+, 
. T t(t 
pon interchanging the order of integration, one ‘ , 
o?—¢ 
»btains instead of (1 exp y(t arccos ( 4 )dt 
a t(f 2)-4 exp v(t 4] or, upon evaluating the first integral, 
' | l 
J i(tpoIo(rt)Ii(ra)dr }dt ~ exp 7 — exp Y r(a—p 
/ i 
(he inner integral is known [4(20)] and therefore, SP t(t 2)-1/2 
nstead of (1 } Ja~p 
— — ; . TP f , a 
*Oregon State Coll depart t of Mathematics (invited paper exp y(t *)*/*| arccos : dt 
’ i ~~ 
Figure rack at rature references at the end of this paper Zot 








Case B, p_-a: 


ere 
U=_" t(82-+- 2*)~°"2 
/? a 
#2 2__g? 
° e\1i/e Tr —-a 
exp|— y(t? + 2*)!/?] arccos ( = jdt 
Zpt 
Substitute ?4 co and get 
— . mage in 
l exp(— yz) exp; —y|2?+(a—p)?*|'”? s* 
Y Y JR 
at 24 5% q? ; 
exp ya) arccos > = wr da, pia, (>) 
«p(ae* py ee 
*R 
l T exp! ya 
e Rk’ 


- © 9 9 9 
a wey a* 
arceos | = ; a\i73 da, pra (4 
=pla st te 


As in [4(21)], R and Ff’ denote the smallest and the 
largest distance of a point of observation from the 
circumference of the disk respectively. These prop- 
erties are given by 


R’ 2? -+-(a—p)?|'/?. 


The expressions (3) and (4) for L’ are considerably 
simpler than [4(12)]. For the time-harmonic case, 
y has to be replaced by 7k. The expressions [4(22)] 
and [4(23)] for a ‘Dirac pulse” are immediately 
obtained from (3) and (4) by taking their Laplace 
inversions with respect to y. 
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Recently Mirsky and Farahat proposed the problem of characterizing the class of doubly 
stochastic matrices for which the least number of permutation matrices necessary to repre 


sent it as a convex sum has a prescribed value 


to the number of eigenvalues of modulus one 


matrices is also treated 


It is shown that this number can be related 


The problem of similarity of doubly stochastic 
Finally, the question of transitivity of powers of sets of functions 


on the first n positive integers into itself is treated by defining a corresponding incidence 


matrix and examining its powers 


I. Introduction 


In 1946 [1]* Birkhoff proved that any doubly 
stochastic (d.s.) matrix with non-negative entries is 
a convex combination of permutation matrices. We 
recall that a matrix is d.s. if every row and column 
sum is 1. Let Q, denote the polyhedron of non- 
negative d.s. matrices and as noted in [5] dim Q, 

(n—1)? and hence any Ae, can be written as a 
convex combination of at most (n—1)?4+ 
tion matrices. Mirsky and Farahat in a recent paper 
6] suggested an investigation of the minimum num- 
ber B(A) of permutation matrices necessary to repre- 
sent AeQ, as a convex combination. In section 2 we 
obtain an inequality relating B(A) to h(A), the num- 
ber of characteristic roots of A of absolute value 1. 
In section 2 we also study a problem of the similarity 
of two matrices in Q, 

In 3 we discuss conditions on A >0 that are implied 
by the equality det (J—A)= II (1—a,y,). 

t=1 

In 4 we obtain an inequality for the difference 
between the maximal characteristic roots of two 
matrices A and B satisfving B>A>0, where B>A 
means 6,,>a,, for all 7,7. In certain cases our ine- 
quality vields better bounds than those obtained by 
applying the results in [2] or [7] 

In 5 we discuss the question of transitivity of sets 
of functions on the set of integers {1, . . ., mn! into 
itself by matrix methods 


2. Doubly Stochastic Matrices 


A precise statement of the result relating A(A) 
and 8(.A) is contained in theorem 3. We prove two 
preliminary results of some interest in themselves 
Recall that A>O is reducible if there exists a per- 


: This work was supported in part by the U.S. Air Force Office of Scientific 
Research 
? The work of this author was supported in part by the U.S. Office of Naval 
Research 
igures In Drackets indicate the literature references @ re en wf this paper 
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1 permuta- | 


mutation matrix P such that 


Ay, 0 
PAP’ ( } 1) 
An A; 


where A,, and A» are square matrices. It is clear 
that if AeQ, then (1) implies that A,,=0. It follows 
that in case AeQ, a permutation matrix ? may be 
chosen so that PAP’ is the direct sum of irreducible 
d.s. matrices and 1-square matrices. 

THeorem 1. Jf Se, and S is irreducible, then h(S) 
is a divisor of n 


Proor. By the classical Perron-Frobenius 
theorem, more recently reproved in [8], there exists 
a permutation matrix P such that 








fo, Sy O eS 4 
(0), S ~~ 
PSP’ , (2) 
Sa 
LS, 0 0» : 


where 0, is a principal square matrix of zeros and 
h(S)=h Let r(X) (ce(Y)) denote the number of 
rows (columns) of Y. Clearly since Se, each 5, is 
d.s. and r (S,) is the sum of the elements of S, as is 
c(S;). Hence, 

riS cs cio rivu ris cC(Sea (3 


' 
Thus every S, is of the same order and A(S) 
is a divisor of n. We next obtain a relation between 


8(S) and 8(S,) where S=S S,, and 
denotes the direct sum 


THeoreM 2. If S,eQ2,,, t=1 m, n=>)\nm% 
i=l 
and S=S, Sin, then 
B(S)< >> B(S)—m+1 (4) 
t=1 
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Proor. First we prove by induction on r+s that 


if 
=a, B=Do,Q 
i=l j=l 
0< 6, <A: <6,,0< gi Sy... Ses, Dd) H=1=— DD g;, 
i=l j=l ; 
P,, Q, are permutation matrices, i=1,...,7, j=l, 
_&, then A+B is in the convex hull of no more 
than r+s—1 permutation matrices of the form 
P.+-Q; When r+s=2, then A=P,, B=Q,, and 
A+B=P A Suppose for convenience that ¢@ <¢; 
Then 


s a] 
and > 1, and 
3 | W 
Y 7] , ” ] a a 
LS ¥ >! 1 
1—@ — 19 1—6@ f7 * bf 
Thus A+ B=6,(P,+49,)+U—6,) R where Re®,, is 
the direct sum of two d.s matrices, the first in 


matrices and the second 
matrices We apply the 
that B(R)<r+s—2 


Then an 


volving r—1 permutation 
involving s permutation 
induction to R to conclude 
Hence 6(A+B)<r4+s—1. 


duction on m vields 


obvious in- 


B(S)=p(S, S.) 
3S Su BUS,,)—1 
3p S,)—m+2+ B(S l 
>> 1S,—m+1 


TH OREM 3 If SeQ, and S is irreducible. the n 


where h ACs 

Proot We first that if P and Q are 
permutation matrices then B(?SQ)=— 8(S) for any 
SeQ,. By theorem 1 let n=gh where q is an integer 
and let #& be the permutation matrix with 1 in 
positions 


observe 


mod n 


qyt, t 


Then from (2) we easily compute that 


206 


PSP’R Si see | S) 1 


and hence by theorem 2 


B(S)< >> B(S)—h+1 
: 


B(PSP’R) 


But by the Birkhoff result quoted in the introduction 
we have that 


Hence, 


BUS) <h q 1)?+1 h 


Actually the above techniques may be applied to 
vield better estimates on B(S) than that oiven by a 
direct application of (6 There are several alter- 
natives for each of the matrices S; in (7): S; is 
reducible; S, is irreducible with A(S 1; S; is irre 
ducible with A(S, i? 2.6... Oe B primitive These 
situations are covered by the following two theorems 

THEOREM 4. If SeQ, and P is such a permutation 


matrix that PSI’ T T.. where each T 
an ny -square matrix eithe irreducible o7 unth 7 | 
then 


n 


ra! ) A(T 


BUS) < S : ( 
=i Nh 
Tueorem 5. Jf SQ, and P and Q are such permu 
tation matrices that 


PSQ=S ae S 
where each S, is an n -square matrix either primit ¢ 
or with n 1. then 

aS pS } 1)? l 


We describe the procedure for obtaining the form 


8) for S: First reduce S to the direct sum of irre 
ducible or l-square d.s. matrices Each of these 
min then be reduced to the form (2) which in turn 


Is reduced 


to a direct sum of irreducible or 
The process 1s repeated until everyv Matrix 


l-square 
matrices 
which appears is either l-square or primitive. We 
illustrate this with the following example: Let SeQ, 
be given by 


0) / 
where A ( ) 0, and /, are t-square zero and 
J, O 


identity matrices respectively, and JQ, is the 
f-square matrix all of whose entries are 1/f. We 
verify directly that no coordinate subspace is held 
invariant by S and hence S is irreducible. We first 
note that S?’=J,+4, and hence the characteristic 


roots of S? are 1 and 0. Hence the characteristic 








roots of S are 1, —1, and 0 with appropriate multi- 
plicities. On the other hand, 1 1s simple by the 
Perron-Frobenius result and tr(S)=0. Hence the 
characteristic roots of S must be 1, 1, and 0 (six 
times) and A(S)=2. Thus S is already in the form 
2) and we bring S by a permutation to the form 


J,+A and by a further one-sided permutation to 


Iut-Tet-demdictte4-Iy +3 


“Thus by theorem 2, 


B(S) < B(J4) + BU BU B(J 4--] q 


A direct application of theorem 3 to J, vields 8(J,) 
10 and (9 implies that 


BOS) <11. 10 
Of course it is obvious by inspection that 6 m FP } 
and henee (9) gives B(S)<5 Note that a direct 
application of theorem 3 vields 8(S)<19 and the 
hound given by the dimension of @, is 8(S)<50 
As a matter of fret, BUS t 


Recently one ol the present authors and \ J. 
Hoffman independently proved that if SeQ then 


there exists a permutation o such that IT s > Lin’ 


It follows that the permanent of S satisfies the 
following inequality 


lo! Se since 
Bis 
S=>) al’ 
Bis 
vhere P, are permutation matrices a da,>0, 7 a 
1. we see that 
Del Ss | S 
It follows that we may apply thre estilnates of 


theorems 3, 4, 5 to obtain a lower bound for S that 
in some cases is better than 1/n’ These results are 
ai otivated by t Conye ture of \ der Waerden 14] 
concerning the minimum value of the permanent ol 
a matrix in 
We next discuss circumstances under which two 
matrices in ®, are simila If SeQ, and VS NX~'eQ 
| of course is I ils that either X or b must be d Ss 
e.g., take SJ] However if the root 1 of S is 
simple then we have the following result 
THEOREM b lf Se and S is irreducible and 
HF, 2«Q... then YX a multiple of a d.s. matriaz 
Voreover, there erists YeQ, such that YSY R 
Proor. Let J be the n-square matrix all of 
whose entries are 1, and let o,(Y) be the sum of the 
entries in the ith row of .\ Then XS=/.X implies 
that VSJ=R XJ XJ=R(XJI Each column of 
YJ is equal to the -tuple a,(X), » on X 
u(X Then Mut u(X and since 1 is 
1 simple characteristic root of S and FP (S is irre- 
lucible), uw(X) must be scalar multiple of the 


0 


vector e all of whose coordinates are 1. Thus 
a;(.X) a, i l, » and by a similar argu- 
ment we easily see that CO; stg BB, 3% # ' 
n. But then a=8 and X is a scalar multiple of 
a d.s. matrix. Now YY commutes with J and thus 
the characteristic roots of X+kJ/ are X ken Ao, 
: ; A», Where A, are the roots of X and \X, cor- 
responds to the characteristic vector ¢ Choose 
k so that A knX¥0 and X+kJ>0O,. and let 
Y a nk (X kJ). This con pletes the proof 


3. Triangular Non-Negative Matrices 


THeorem 7. Let A>O with marimal non-nega- 
ti ¢ characteristic root ys ¥ The a necessary 
and sufficient condition that there exists a permutation 
matrix P such that PAP’ is trianqula 


II (l—-a det (J,—A 1] 


Proor. We prove first that if 


then Ati {) lor Pap In this urgument and 
subsequently we use the classical res ilt that states 


that if a is the dominant non-negative characteristi 


root ot a principal submatrix of A, the a * r and 
if A is irreducible then the inequality 1s. strict. 
Hence, f(r | Pe ss det rl A 0 and the 
inequality is strict if A is irreducible To prove 
that 7(7 () we proceed by induction on Now 


differentiatu 12) we have 


where A is the principal submatrix of A obtained 
by Omitting the ith row and col inh Kor each 

the summand in (13) is non-negative by the in 
ductiol hypothesis since tf r and is not smallet 
than the dominant non negative characteristic root 
of Al Thus f’(t O and f () so that 
f(1)>0 Moreover if A were irreducible it would 


follow that f(1) >f(r) =, 2,0 
We complet the prool as follows (‘hoose a 
permutation matrix P such that PA/” has the 


a {) * 


following sequence of blo« ks down the mall diagonal 
B , +B. =5 

where each 2, is 1-square or possibly square ur 

ducible. We show that the second alternative 


is impossible and it follows that ?A/’’ is triangular. 
Clearly (11) implies that 


1 l—a WI (1—h det (J,—A det (/ B 


II det es b, 


207 








Suppose some B,=C is irreducible. Then by the | 
above argument 


he 
IT (1 —c,,) det (/,, B,), 
i=l 


and we would conclude that 


™ 


1 det (/,,—B,)< (i—a 
k= 1 


i=l 


a contradiction. This completes the proof. 
4. Inequalities for the Dominant 
Characteristic Root 


Let £\ denote the unit n-simplex. If A2 0 is 
irreducible, let a be the dominant positive root and 
let 0<reH™ be the unique characteristic vector of A 


in £” corresponding to a. Suppose R>0, R#¥0 
commutes with A. Then Ar=ar, RAr=—al?r, 
A(Rr/o(Rr))=alRer/o(Rr)) where if y=(yy Yn 
then o(y) => )i2,y,. But then the uniqueness of rs 


implies that Rr/o(Rr) =r because (Rr/o(Rr))ek™. 


THeorem 8. Let BZA be two 
irreducible n-square matrices with dominant positive 


non-negative 


roots B and a respectively. Let M=mar (by—ay), 
i,j 
m=min (by—ay) and suppose that R is a_ non- 
td . . . | 
negative matrix which commutes with either A or B | 
and POSsesses NO Zero column 8. The n 
m M 
= B a - = 
r ; 
max Max— min min 14 
j ‘ t ‘ 
Daley Doles 
t= i 
Proor. Let yk, y>0, satisfy Ay=ay and | 
let w= (By) /y;=max(By),./y,.2 8. Then we assert 
that 8—a< M/y,; for (B—A)y+ Ay= By, 
(B A)y + ay By, 
n 
Do bu dude tay. = (By),, 
k=1 
l . By 
> > be Dig) Ue ax ; Ae > B 
Yik=1 Yi 
Now yek . hence (bu Qin dU * V/, and 
B—a<M/y,<M/min y, 
k 
From the above discussion we know that Ui] Ry 
a(Ry). Consider the function 
Sra 
&,(r) ~ — rele (15) 
Oo a F193 
oS j 


minimizing the right-hand side of (15) over all 


rek™ vields 


%>min -~; 
t ‘ 
> Pit 


Thus 


B—a<M/min y,.<.M//min min 
i x f . %, r 


Ad tt 


A similar computation establishes the lower bound 
in (14). 

Note that P is any non-negative matrix commuting 
with A or B; e.g., any non-negative polynomial in A 
with no zero columns. We also remark that in prac- 
tice we compute each column sum of /?, divide it into 
the least element in the column, and take the least 
such quotient to obtain the denominator of the 
upper bound in (14 Similarly for the lower bound 


5. A Matrix Condition for Transitivity 


Let L, denote the set of integers {1,. . ., nm} and 
suppose o,, .. ., o are functions mapping L, into 
itself We define 4, to be the set of functions of 
the form o;,0,,... is 1<i,<k. We say that 
Y%, is transitive for some p if for any given s and f¢ 
in L, there exists ge%, such that o(s)=t. We define 
matrix A associated with Y, 

we define (,; as the matrix 
0 otherwise; then 


an n-square incidence 
us follows For each o, 
whose (s, ¢) entry is 1 if o,(s)=t, 


It isclear that A>0 and every row 


sum of A is 1; 1.e., A is row stochastic. We remark 
that if T, denotes the totality of non-negative row 
stochastic n-square matrices, then an easv arguinent 
is the convex hull of the set of all 


let A , t) 4 


— 


~~ 


16) thows that T, 
matrices with exactly one 1 in each row. 

In general let A correspond to o,, . .o, and B 
LO @;, . oy Dy Then the i, J entry of AB is posi- 
tive if and only if the integer 7 is mapped by some 
g, into p and p is mapped by some ¢3 into 7. That 
IS, 0304 i J Thus we have 

THEOREM 9. A necessary and sufficiert condition 
that A, be transitive is that A?> 0. 

We easily obtain two corollaries to this result 
Let Ba 0}. o be a set of 


COROLLARY | as 
into itself satisfying 


functions on L, Be mn 
nu) at least one a; has a hred point, 
(b) no propel subset oF is is left invariant (as a 
set) by every o,e%,. Then A, is transitive forp -2n 
) 

Proor. Condition (b 
matrix A is irreducible. 
tr(A)>0 and hence A(A)=1. 
and the integer 2 2 follows 


implies that the incidence 
Condition (a) implies that 
Hence A is primitive 
2n from a result of 


Holladay and Varga [3] 
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Coro.tuary 2. Let , 0} 


ewe es be a set of 
permutations of the integers {1,. . .,m L,, satisfying 
(a) there exists an integer s in L, and a pair of 

pe rmutations CO; and o; such that a.0.(8) Ss 


(b no proper subset of L. is left invariant \as a 
set) by every o.€%,, 


Cc n is odd 
Then A, 


is transitive for p>4n | 


Proor. If there exists a o,, and an integer v such 
that » then we have the conditions of corol- 
lary 1 and W, is transitive for p>2n—2 without the 
condition (c If not, let A be the incidence matrix 
and then tr(A 0, but the sum of all the second 
order principal subdeterminants of A is not zero 


On D 


4 Hence A(A) is 1 or 2. But since AeQ, we apply 
ri theorem 1 to conclude that A(A) divides n Thus 
a h(A)=1 and A is primitive. Now A? has a positive 

element on the main diagonal and h(A? 1. Thus 


AS 
applying the result in [3] 
completing the proof. 
Note that condition (c 
the following example: %, 
o.—(1432 
lary 2. 


is also irreducible and thus primitive so that 
again we have (A?)2*-?>0 
is essential as shown by 
01, o} where o; 1234 
satisfies conditions (a) and (b) of corol- 
However, %, is not transitive for any p 
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Probability Inequalities of the Tchebychetf Type 


I. Richard Savage ' 


(May 


Phirt 
tailed and m 
inequalities are presented 


iltidimensional inequalities 


can be used in handbook style without cross referencing 
Examples of uses of the inequalities, tables comparing the 
ties ind bibliographic data throug! L960 are 
I. Introduction 
1. Background 
The author originally prepared this survey and 
bibliography of probability inequalities of the 
Tchebycheff type, early in 1952, for convenient 
reference within the Statistical Engineering Lab- 
oratory of the National Bureau of Standards 
( opies ot the bibliography and ol the survey were 
riven limited circulation, within the Bureau and 
imong a few individual specialists In this area, 1n 
April and June 1952, respectively, with a view to 
revision for ultimate pul lication 
Shortly thereafter the author, in his capacity as 
Associate Editor of the Journal the Americar 


Statistical Association, learned of the preparation by 
H.d University College of Wales 
of a much more comprehensive treatme it ol inequal- 
Tchebychetf tv pe, and ave his ful 
Uhh ndertaking which resulted in a 


Godwin. 


SWanse: 


ities oO the 


support to 


definitive publication on the subject [Godwin (1955)] 
\s the vears hay passec, however, Many persons 
vho have had aec ss to the present author’s original 
surve have iound it to be a tar more convenient 
sour ol cirectly applicable informat mm lor prac 
cal ipplication than Godwin’s more compre hensive 
papel ( onsequently in response to the urgings of 
urious prole ssional colle ivves, the author has 
Orougnt togethe an combined in the present papel 
his 1952 survey and bibliography ol prol ability 
nequalities of the Techebvcheff tvpe, with a few 
additions and revisions necessitated by the passagt 
of time and the author’s increased knowledge of the 
ibject 


2. Scope and Organization 


Tchebychefl inequalities vive bounds for the prob- 
ibility of certain events in particular they give 


\ 


as well as the 
Sufficient detail is given for each inequality so that the material 
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een basic inequalities relating tail area probabilities to moments are stated. One 


classical two-tailed, unidimensional 


or familiarity with the entire article 


itive trengths of the 


Inequat- 
included 


estimates for deviations from the mean in terms of 
the moments. 

A selected collection of Tchebychefl inequalities is 
given. They have been for their diverse 
nature and for thei in applied and 
theoretical work 

In section Ll the various inequaliti sare presented 
with their and the conditions 
under which they mieay In several Cc 
more than one form of the equality Is prest nted in 
e inequal Ly 
random Vari 


selected 
usefulness 


some notes on uses 


be used ises 
ordet ton ake it easier to work with tl 
With each inequality the nature of the 
able is specified ; that is, it is indicated whether the 


random variable is arbitrary, uv Sul or some other 
function Other special conditions are stated, al 
ways including the dimension of the random vari 


ables, and the moments which are assumed to exist 

In section II], tables are given for finding the 
particular probabilities associated with a specific 
inequality Tables ure also given fol determining 





the required size of the variable parameters when a 
particular probability is desired. These tables will 
be found useful in choosing which of the unc qualities 
to use 
the inequalities which involve several parameters. 

Several eXall ples are section L\ showing 
how some of these inequalities can bye used The sec 
examples show Various possible uses but ure by ho 


Tables have not been prepared for some of 


riven in 


means exhaustive 
The bibliography in section \ 


is 


as comple ie «as 


possible The books by Uspensky 1937 and 
Fréchet 1950), and the papel by Godwin (1955) ar 
recommended as good surveys of the subje et 
Techebvchefl inequalities are useful for working 
with distributions whose functional form is un 
known. In many situations it is possible to avoid 
the assumption that random variables are Sa \ 


normally distributed. All that is needed for use of 


these inequalities are rood estimates ot certain popu 
is required 


lation moments Sometimes somet! 


of the functional form of the density function of a 
distribution. This is true for inequalities 9, 10, and 
lla in the text However, it is easv to verify 








whether the necessary conditions are true for the 
distributions that one is discussing. 

In statistical work these inequalities have had 
several uses. In working theoretical problems, it is 
often necessary to use these inequalities, for in- 
stance, in proving the weak law of large numbers for 
binomial distributions. These inequalities are par- 
ticularly useful for testing hypotheses and finding 
confidence intervals for the mean of a distribution, 
if one has some information about the other moments. 
In industrial work, these inequalities have been used 
to form ‘“‘tolerance”’ sets 

Usually one does not have the true values for all 
of the parameters that are needed for using these 
inequalities. But if one has upper bounds for the 
parameters, that is for the moments, then one can 
use these inequalities. If one has run a process many 
times with the type of material, then one 
usually has a good idea of the variance, even if the 
process mean has been shifted, so that in a sense one 
often knows some of the moments, and in this way 
one can test for the others . 


Ssaimne 


Remember that a sample acts as a population ; 
therefore, once the moments have been computed 
for a sample, all of these inequalities will be true for 
that sample; that is, these inequalities will provide 
bounds for the portion of the sample in various parts 
of its range To obtain lower bounds for the prob- 
abilities in the tails of the distribution it is usually 
necessary to assume that the random variables are 
bounded. 

Most of the inequalities presented are for the 
univariate There are several papers that 
discuss the multivariate case in much more detail; 
in particular, see Camp (1948), Leser (1942), Pear- 
son (1919), and Marshall and Olkin (1960a). Most 
of the multivariate inequalities have been omitted 
becuuse they are quite complicated and hard to 
apply. For each inequality presented here, the 
dimension of the random variable is specified, and 
this is a clue to deciding which one of the inequali- 
ties is applicable to a specific problem 


case 


Several of the inequalities given require special 
assumptions on the shape of the involved distribu- 
tions. All special assumptions require 
that the distribution has an unique mode. Narumi 
(1923) treated the opposite case, where the distribu- 
tions have an unique minimum and increase as you 
go away from it. This case did not seem to be as 
important as the other and is omitted. 


of these 


Winsten (1946) found inequalities that involve 
the ranges for various sample sizes. These inequali- 
ties will undoubtedly prove useful in the future; 


but they are not entirely analogous to the Tcheby- 
cheff inequalities and were omitted. 

The Markov Inequality I° contains many of the 
other ones as special cases, which is a little surprising 
since this is the simplest of all the inequalities 


These 
lo not 


* Each inequality has been given a name, mostly for convenience 
names do not necessarily reflect priority It is hoped that the name 
conflict with common usage 


This results from the fact that lis true for any positive 
random variable, XY, that has a finite expected value 
In particular, la is derived from 1 by replacing 
by a sum of random variables. Inequality 2 is 
obtained by replacing X by the square of the dif- 
ference between a random variable and its expected 
value. One can derive many of the other inequali- 
ties in this manner 

In cases where the inequality is given only for the 
random variable .Y minus its mean, there are also 
inequalities for a sample average minus the expected 
value of that average. 

Most of the inequalities are stated in the form of 
upper bounds for the probability that a random 
variable is greater than or equal to some number 
There are opposite inequalities, lower bounds for the 
probability that the random variable is less than the 
same number. These are the expressions 
with the inequality reversed within the probability 
symbol (the “greater than or equal’? symbol being 
replaced by a “less than”’ symbol , and with the 
right-hand side replaced by one minus the original 


same 


right-hand side 

As given, some of the inequalities are very weak, 
for the right-hand sides may be greater than unity; 
but a probability is always less than or equal to 
unity, so the right-hand sides should be interpreted 
as the minimum of the given expression and unity. 

Most of these inequalities cannot be improved; 
that is, the right-hand sides cannot be replaced by 
smaller quantities. That is, usually the left-hand 
side equals the right-hand side for some distribution 
that satisfies the conditions under which the in- 
equality holds. Of course this will only occur for 
certain exceptional cases. If the exceptional case 
is known to be impossible, there might be a better 
inequality available [see Godwin (1955)] 

In the following inequalities, unless otherwise 
noted, \ is any positive number. LX equals the 
expected value of the random variable, and will be 
denoted by yu; if need be this will be given a subscript 
The expectation sign iD will be used to denote other 
expected values depending on the argument that 
follows it. That is, “(.Y—y)*? will be the variance 
and will be denoted by o The S\ mbol P A) means 
the probability of the event A 


It gives me great pleasure to thank the staff of the 
Statistical Engineering Laboratory of the National 
Bureau of Standards for their encouragement during 
the preparation of this manuscript. I am particu- 
larly indebted to Joan R. Rosenblatt for her careful 
reading and detailed comments on the penultimate 
manuscript and to Lola 5. Deming for her skillful 
management in revising this manuscript for publi 
cation. Professor Ingram Olkin of the University 
of Minnesota has been a guide in removing 
scurisms, in suggesting crucial changes to bring the 
original manuscript up to date, and in the prepara- 
tion of the final bibliography 
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II. Inequalities Refe rences: Cramér (1946), Fréchet (1937) 


1. Markov Notes: 
d 1. This inequality is formally the same as 1, but 
Pr XxX, n)<1/r 1) | Shows how I can be used where the random variable 
— of interest is actually the sum of several random 


variables. 


P(X>)X) <w/A 1’) : 
2. It is clear how this inequality is derived from 


y the first one, for it is the same as that one. xcept 
° that the random variable can be written in two ways, 
that is, either as Y or as a sum of Y 


Random variable 


Restrictions: .Y is non-negative, that is, PLY<0)=0 
Dimension: One 
Moments: uw=LX 


2. Bienaymeé-Tchebycheff 
References: Cramér (1946), Fréchet (1937). 


4 7 ) 

Votes: / \ v1 do LA 9 

1. This is a fundamental inequality from which P(X>do+p or X<p—do 1/r 9! 
inequalities la, 2, 2a, 2b, 3, 5, 6, 7, 8, 13, and 13a 

can be derived by picking X as a special function of P(|X—p) >) <o?/d 2’ 


the random variables of interest 


Random \ ariable: 4 


2. In using this inequality, note that one needs 

aie | Restric None 

to know only one moment, or one Is testing an hy- restrictions: None 
pothesis about only one moment Dimension: One 


3. By itself this is a rather weak inequality, for | Moments: o2=E(.Y—,): 
the probability is bounded by 1/A; this is of course 
to be expected, since only one moment is being used | References: Cramér (1946), Fréchet (1937), Uspensky 


and therefore one has very little knowledge about (1937 
the involved distribution, or at least is only using 
very little of this knowledge Notes: 
4. Equality holds in 1, with A>1, for the random 1. This is the standard Tchebycheff inequality for 
variable Y which takes only the values zero and pX, one random variable. 
with probabilities (A—1)/A and 1/A, respectively ; 
, , , 2. Now the ‘obal S creasing as , 
If \<1, the left-hand side of 1 is unity for the de- — ie probability is decreasing as 1/0’, 
eh i which means that the probability of large deviations 
generate random variable which is equal to uw with . 
eee , | from the mean becomes quite small. It is to be 
probability one. A similar argument shows that I 
, noted that for the normal distribution and for large 


is also a sharp inequality 
tn ' this Is actually a very poor estimate of the proba- 


la. Markov bility of large deviations, for there the probability of 

a large deviation is smaller than e ”, but for 

Y intermediate values this is not a bad approximation. 
P(=>nr)<1/ la 

u 3. If one has a fairly good estimate of ¢, then this 

‘ inequality can be used for testing hypotheses about 

P | > As] > ite >»): 1/r la’ the mean, and for finding confidence intervals for 

i= i=i the mean. In many industrial applications this 

inequality is used for estimating how much of the 

Random variable: production will be near the mean of the process, 

Y¥—-s XY where one has a good idea of the variance 


, : 2a. Bienaymeé-Tchebycheff 
Restrictions: Each .X; is non-negative ¥ teen 


Dimension: Each .X, is one-dimensional, but actu- P(yn|X—p| > do) <1/d (2a) 
ally the X, may be considered as one observation ie ; : a 
on an n-dimensional random variable; i.e., the X, P(|X—p| >) So*/nd (2a’) 
may be dependent. 


zs 1 & ~; 
mm | Random variable: .\ > Ay 
Moments: »,—FEX, | n i=! 
| 
a Restrictions: If 1#j, then X, and X, are uncorre- 
a | lated; that is, ELXY,—EX,)(X,—EX,)=0 
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Dimension: One 


Moments: u=FEX, 
a I Ae M 4 
References: Cramér (1946), Fréchet (1937), Uspensky 
(1937). 
Note a. 


1. This is one of the most useful of the Tcheby- 
cheff inequalities. One can use this whenever he 
has sample averages of identically and independently 
distributed random vuriables. 

2. This inequality gives the “square root of the 
That is, |X—uy! is of the order of 


magnitude 1/yn in probability. 


sample size”’ law. 


3. The uses of this inequality are much like those 
f 2, but it can be used when working with sample 


averages. 


2b. Bienaymé-Tchebycheff 


P (\Y—pi do) < 1/ (2b) 
s 8 
Oy N<SS Dh’ 
/ X iv > > 29 A 2b 
' i 
" 
Random variable: Y=>) A 
i=l 
Restrictions: None 
Dimension: 7 


Moments: My LAs M » > My 


0 i At M X, My) 
‘ " 
a — p 1 
t=] j=] 


he ference Uspensky (1937 
Notes: 

l This Is another form of the Tchebychetf lli- 
In this case the random variables forming 


equality 
different variances and can 


the sum 
be correlated. 


can have 


2. In the future, inequalities are presented only 


for samples of one observation. 





The detailed treat- | 


ments of the versions of 1 and 2 for sample sums 
and sample averages can be modified to apply to 
the other inequalities. 

3. Another point of interest is that, although the 
inequalities appear to require exact estimates of 
certain moments before they can be used, it is pos- 
sible to get similar inequalities by substituting upper 
bounds for the moments that are involved. For 
instance, in this inequality one might not actually 
know the value of o, but from previous experience 
he might know that it can not under any circum- 
stances be greater than, say, o’. In this case, if one 
uses o’ instead of ¢, then the inequality will not be 
as good as the given one, but still may prove useful 
This technique can, with some care, be used for all 
the inequalities 

3. Pearson 


P( * “> )<1/n" 
P (.X—p! >r)<B,/r" } 

X—up B, ; 
P( = p >r)< ) 


Random variable: Y 


Restrictions: None 
Dimension: One 
Moments: yu LA 
ef=—h(X—p 
3,=Fh\X— yp!’ 
References: Fréchet (1937), Narumi (1923 , Pearson 
1919 
Notes: 


1. In order to use this inequality, an absolut: 
moment of the random variable Is required. 

2. If several absolute moments are available and 
one needs an inequality for a particular A, then us« 
that moment that makes the right-hand side of 3’ the 
smallest for that particular A. Thus, for instance 
3’’ should be compared to 2: for large values of X it 
may provide a smaller bound 


4. Birnbaum, Raymond, and Zuckerman 


If n is an even integer, 


‘a l if M< ne 
. no f “ 2 lid - 
4 , ° > il no | a (3 ) 
P( 2a (Xi My) d* ): 4 2* lio a . 
no no*\. .. no 
l f (3+ y5)<X 
L ( me) if wre 


214 








us 
to 


of 


DS- 


OI 


lly 


Ill- 
ne 
be 
ul 
all 


On 


If n is an odd integer, 





g l if A“ < ho- 
n = - iW no*?<d< ; (3n--1 yon 67 5 
P( >> (Xi—4,)” rv? )< 4 27—(n—1)o? } 
i=] . ” 
no- \/ é l 0 0 : . 
. if »/ l yon on ) \ 
L iw 4 


Moments: by EX, 


Random variable: >) (X,;—yz 


Restrictions: The X,'s are independently distributed of = KE Xi—u, 
with common variance. 


Dimension: 
Moments ld EX, 


7 Kk a by 


References: Berge 1938), Lal 1955), Olkin and 


, . j ‘ 4 ! ‘ / ‘kermis ~ , p 
Re erence. Birnbaum, Raymond, and Zuckerman Pratt (1958). Whittle (1958b 
194% 
Notes: Votes 
mr . } . = , rye ! 1 
1. This inequality is an upper bound on the prob |. This inequality bounds the probability of falling 


ability of the sample pomt falling outside of a hy per- 
sphere centered at the population mean. Birnbaum, 


Ravmond, and Zu kerman 1947 also vives bounds 


outside of a rectangle centered at the means tor a 
bivariate sampl 


for hyper-ellipses 2. This inequality uses the dependence between 
2? The application of this nequality to bombing the random variables, and therefore, in order to 

and other aiming problems Is obvious. apply this ine quality one needs a tuallv to have 
3. This inequality is multidimensional in that the | some knowledge about the correlation. 

probability of a multidimensional set is bounded — ., 

ry. ) The right-hand side of this inequality is a 

Che random variables, however, are assumed to be Poss : , ; et * | 

- aecreasing inction oO Le rrelation s ’ 
independent. Inequality 5 is multidimensional in a ee nus, tnt 


most impressive results are obtaine eC] ‘ 
both senses, @€.2., the probability ol falling na rec- , shits are OD uned wh . the COr 


relation is one, Even when the rar m variables 
tangle is bo inded and Lie random variabl Ss are in- , ilo iriable 
— are independent (zero correlation) the right-hand 
( eCTrLGenL. | P " 1 
“4 pop a ne ults { ld and even | "He 18 2/4° which is not quite as strong as could be 

Ss ) recomes Taree neresu Sior oat maha ¢ el ifr . . 
: . obtained irom 2 but still useful 

integers 7 approach each othe! 


$. Marshall and Olkin (1960a) found a “‘one-sided’ 
5. Berge version of this inequality and its 


clime nsions 


eX sion to p 


P ( either — . >A Or — — >): 


6. Guttman 


; 
\ 1 
P(X u, >A, o, or LX bo > oo 
: } X 5 2 A 
\ \ p-AsA5 I ' = : 
2dr . , | Random variable: A 
; / 
Random variable: Ai, A . : ; - ~~ 
Restrictions: Ae are identically and independently 
Restrictions: None distributed 
Dimension: Two Dimension: One 
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Moments: 7 Ae 


a? = fy Xi u)? 


and define 


] " _— 
‘2 . 4 "\2 
Hun- > (X,—X) 


References: Guttman (1948b), Midzuno (1950). 


Note: 

Inequality 6 is applicable whenever 2a can be 
used, however, 6 takes more computing than 2a 
does, and that is its only disadvantage. If we com- 
pare these two inequalities, they differ only in the 
quantity on the right of the inequality sign within 
the probability statement. For 6 that quantity is 
a random variable, and it is not for 2a. The ex- 
pected size of this random variable is much smaller 
than that of the quantity that occurs in 2a, and 


therefore with probability near one it is a better 
inequality to use than 2a 
7. Kolmogorov 
P(T> od) <1) 7 
Random variable: S,;= >) (Y,—yu,);i=1,2,...,n 
l 
Let 7=max[|S;,|,..., |S,|,...|S, 
Restrictions: .Y, and .Y, independent 
Dimension: One 
Moments: 4,—/.X, 
o* pb E(X,—p,)? 
i=] 
References: Fréchet (1937), Kolmogoroff (1928), 


Marshall (1960), Uspensky (1937). 


Note: 

This inequality can be used whenever 2a is appli- 
cable. A typical use of this inequality is in the 
extreme-value situations. For instance, if one 
putting together an assembly one might ask what 
is the probability that the cumulative error ever 
exceeds a certain quantity, and this inequality would 
give the answer 


is 


8. Cantelli 
. o ; 
P(X—psn): ey if <0 (8) 
os 
> 1-=>> if A>0 
Random variable: .Y 


Restrictions: None 


| 
| 





Dimension: One 


Moments: »=EX 
a? = K(X— yp)? 
References: Cantelli (1928), Cramér (1946), Uspensky 
(1937) 
Notes: 
1. This inequality is applicable whenever in- 


equality 2a can be used 

2. In this case one is interested in one-sided al- 
ternatives; that is. one wishes to detect large posi- 
tive deviations from the mean. This occurs, for 
instance, whenever one is using one-sided confidence 
intervals or one-sided test regions 

3. The derivation of this inequality essentially 
depends on the Schwartz inequality. 

+. Comparisons of “‘‘one-sided”’ and ‘“‘two-sided”’ 
alternatives for certain convex sets are obtained by 
Marshall and Olkin (1960b) : 


9. Gauss (Camp-Meidell) 


‘a d 2 ae 
|- for \< 1.1547 
V3 /3 
et M >XT 
4 
for Y) 
9d? V3 
: 41+s? ., 
) " ’ a 
PU\X—p Ao sa A a HA Q 
Random variable: Y 
Restrictions: Y has a density function with one 
mode, vi 
Dimension: One 
Moments: bu EX 
of=h(X—yz)? 
r*=0° + (u—p 
up 
o 
References: Cramér (1946), Fréchet 1937 Gauss 
ISSO), Narumi (1923). 
Notes: 


1. This inequality requires the same knowedge of 
moments as does inequality 2; but it is also necessary 
to know the mode of the distribution. For a sym- 
metric distribution, of course, the mode is the same 
as the mean. 

2. For asymmetric unimodal distribution, this is a 


better inequality than 2, since the bound in 2 is here 


multiplied by 4/9. The inequality 9’ has a par- 
ticularly simple form when s=0. Indeed, for any 
unimodal distribution such that a bound for s is 


known, this inequality is better than 2 for suf- 
ficiently large values of X. 

3. If this decreasing property actually is true, 
then this is a better inequality to use than 2, for it 
essentially multiplies the bound by 4/9. 

4. This inequality in the form 9 is a special case of 
inequality 10’. 
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10. Narumi, Gauss (Camp-Meidell) 





F / l ] ; - > rb 
s! at pr if OSAS- +] 
] rb - = | 
P(\X—py'>Av1 ) < >; if = ;SAsb 10) | 
om 
c= if b<X 
ae | 
n - 
<1 if A< 
l+r) ( r) 
P(| X—p)| > dyvr,) (10’) 
7 ? 7 
[: yO) if a N 


Random variable: X 

Restrictions: Y has a density function f(x), and 
f(z) has an unique maximum in the interval 
(uo—b, potb) at w and b6>0. Use (10) if } is 
finite, and (10’) if 6 is infinite. 

Dimension: One 

BE X—4p 


Fréchet (1937), 


Moments: (y 


References: Narumi (1923 


Votes: 

|. In 10 and 10’ the absolute moments about the 
mode are used rather than the absolute moments 
about the mean. For unimodal symmetric distribu- 
tions that is not a restriction 

2. For many of the common distributions 6 will 
be infinity and 10’ can be used in preference to 10 
In applications, however, one might only be sure of 
the behavior of the density function near the mode 
and thus 10 is required 

3. Actually, in most cases one would probably use 
inequality 9, which is a special case of these in- | 
equalities; this is why it has been given by itself 


ll. Peek 


. 1—é? 
4 4 > < ’ X > 6 { ) 
BX wl 2a) Sy gp’ 82? . 
Random variable: .Y 
None 


Dimension: One 


Moments mM EX 
oa BR(LX 
FIX 


Restrictions: 


—~ 
i= 


v 4 
6 Vio | 


Reference : Peek (1933 


Notes: 

|. This inequality is much like inequality 2, except 
that here one needs to know v, the mean deviation. 
If one has this additional information, this may be a | 
better inequality to use. 
2. This is a special case of inequality 12. 


lla. Peek 


. 4 
P(|\X—p|>do) < (lla) 


—9 (A—6)? 
Random variable: Y 


Restrictions: Y has a density function whose only 
mode is its mean 


Dimension: One 


Moments: p=EX 


o* = h(X—p)’ 
v E X M 
6=v/a 


Reference: Peek (1933). 


Note: 

This is an improvement over 9 for relatively large 
\, but can only be used if one has an estimate of the 
mean deviation. For a particular value of \, this 
should be compared with 9 and with 12 


12. Cantelli 


. p - p 
P(X >r)< if V< 
me ’ ad 
De B p \" 12 
- - j 2 
(X 8,)*-- Bo, — B= re} 


Random variable: P 4 
None 
Dimension: One 
Moments: bu BX 


Restrictions 


Bo, = E|.X—p|*" 
p Ee ba rv J 
References: Cantelli (1910), Fréchet 1937), Peek 
(1933) 
Notes: 


1. Of course in this case one needs information 
about two moments. 

2. The first part of 12 is equivalent to 3’; for 
larger values of \, 12 is better than 3’ when the 
required two moments are known. 


13. Bernstein 
P(|\X—p 
Random variable: .Y=3S.Y 


X, and XV, are independent 


h y-+20% 


>A) <2e sails 13 


Restrictions 


Dimension: One 


Moments: p,—ENX,, uly 
o I (Ae be o wu 
’ /e—-2 
i _ o,s! ¢ ‘ 
Dp Ae Mil’ S > for all integers s and 


some constant C'(C' >0). 


Curtiss (1950). Fréchet 


Uspensky (1937). 


References: Craig (1933), 


(1937), 
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Notes: 

1. This is a very nice inequality in the sense that 
the bound goes to zero very rapidly as \ increases. 

2. The one difficulty in applying this inequality is 
that essentially one has to know at least an upper 
bound for every moment of each distribution in- 
volved. The next inequality treats a useful par- 
ticular case. 


13a. Bernstein 


P(|X—p|>)<2e™™ av*+§ ma) (13a) 
Random variable: Y= ZX, 
Restrictions: P(|X,—p,;| >m)=0, X; and XX, are in- 


The maximum deviation of a 


dependent (i #)). 
X, from its mean yp, will not 


random variable 


exceed m, with probability one. 
Dimension: One 
Moments; u,—EX, p= Dy 
o2=E(X,—up,)’, of te; 


References: Fréchet (1937), Uspensky (1937). 


Note: 

Each YY, has a distribution which does not allow 
deviations greater than m from y,. This condition 
also bounds all of the central moments. 


13b. Bernstein 
P(X 


ul>or\)<e*”? (13b) 


A=ZX, 


NV, are independent (i#)); X, is 
-ie., A(X,—p,)"t'=0 (r=—0, 


Random variable 


Restrictions: .X,, 
symmetrical about u 
> P .) 

Dimension: One 


BEX, p 


Moments: yg, Du 


2.7 (Op\t 
1 w wn 2h (ar). : : 
E(x, ws)? <( - ) - (r an integer) 
of= K(X ,—p,)?, c? =2o07 


Reference: Uspensky (1937). 


Note: 

This form of the Bernstein inequality places many 
restrictions on the underlying distributions through 
their moments. The normal distribution does satisfy 
these conditions. 


Ill. Tables 


Essentially tables 1 and 2 are inverses of each 
other, column by column. The first table answers 


the question: How large is the probability associated 


with a specific “deviation” \? The second table 
gives the ‘deviation’ associated with specific 


probabilities P. 
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P(X u>r)<0.10, 


TABLE | Probability associated with deviation X 
Probability l 2 3 4 5 6 7 
associated 
with 
l 1 1 4 4\‘1 l 
. A r Dr (zp 1+? Ses 
1.0 1.000 1.000 1. 000 0. 444 0. 4096 0. 5000 0. 
1 0. 667 0.444 0. 1975+ lus Os09 3077 325 
2.0 SOO 25) O625 lll 0256 2000 135 
2. 5 4100 1 0256 O71 0105 1379 O44 
3.0 333 lll O123 O49 0050 1000 oll 
; pant OR 00687 036 0027 O755 002 
io 2%) on 0039 O28 0016 O5SSS8 ooo 
4 222 O49 oo24 022 oolo 471 
0 20 040 oo16 OS OOO07 O3S85 
182 033 ool Os 0005 0320 
4.0 167 (2s 0008 012 0003 0270 iu 
6.5 14 024 0006 oll ooo2 0231 000 
TABLE 2 Deviation associated with probability l 
Deviation l 2 3 ‘ 5 6 7 
associated 
l l 1 2 4 .. i 
I ‘ ‘ | 2k 
I VP I 3yP Sf Vp-1'¥ i 
0.99 1. owl 1.0050 1. 002s 0. 6700 0. 802 0. 1005 0. 1418 
ie 1.0526 1.0260 1.0129 6840 S10 293 3202 
m) 1.1111 1. 0541 1. 0267 7027 821 $333 4590 
7 1.3333 | 1. 1547 1. 07 4 THUS seit 5773 7585 
“1 2 1.4142 | 1. 1882 ¥428 951 1 1.1774 
2 4 1.4142 1.3333 1.131 1. 7321 1. 651 
10 10 3.1623 1.7783 1082 (1.423 | 3 2. 1460 
0 v1) 4.4721 2.1147 14 1.602 4. 3589 2.4477 
ol luo 10 3. 1623 th, Oe 2. 530 ¥. 4449 +. 0348 
0O1 1000 31.6228 5.6234 21.0818 |4. 500 (31. 6070 3. 7169 


An example is now given showing how to use these 


tables. Suppose one has a sample of nine inde- 
pendent observations from a distribution whose 
variance is four units squared. (1) What is an 


upper bound for the probability that the sample 
average is more than one unit larger than the popu- 
lation mean? (2) How far above the population 
mean could the ninety percent point of the distri- 
bution of sample means be? (3) If one knew the 
population had an unique mode at its mean, could 
these results be improved? Answer: For questions 
(1) and (2) one needs inequality 8, and for (3) 
needs 9’. 

To answer question (1), determine an upper bound 
for P(X—yp2 1) using the second part of 8, since 
A=1>0. Since 8 is given for a random variable Y 
with variance ¢X, and the present question involves 
the average of n=9 observations from a distribution 
4/9, in 


one 


with ox—4, we use the variance of LX, oF 


inequality 8. Thus, 


P(X—p2 


The value of this bound is found in column (6) of 
table 1 for A=3/2=1.5, which shows 1/(1+ .?) 
0.3077. 
To answer question (2), determine the smallest 
value of A for which 





again using the second part of 8, The required value 


of X satisfies 


. ox A 0.10 
o=+t * 
or 
0.10. 
1+-A?/a=% 


The smallest value of \/oz which satisfies inequality 
8 is found in column (6) of table 2 for P=0.10, which 
shows A/ox=3, whence \=3e7=—2 

To answer question 3), the two previous questions 
are treated again using inequality 9’. This makes 
of the additional information that the distribu- 
tion is unimodal with its mode yp, at the mean (s=0 
in 9’ On the other hand, 9’ two-sided in- 
equality The first step to evaluate upper 
bound for 


use 
Is a 


an 


Is 


P(X—p2 1)<P(\N<y!> 1 

ising 9’ with Acozg—1, that is with A=3/2. This is 

found in column (4) of table 1, £/9X 
0.198 This is a smaller uppel bound than the one 


obtained from 8 


W hich shows 


The second part ol 4 is to find the smallest value 
of A ior whicl 
PUN<p \ 0.10 
his value is given by 
4 l 
0O.10 
Q X IT 
and from column 1) of table 2 for P=0.10 is 
found to be A tx—2.1082 or A 1.41 This is better 
smaller than the value obtained by use of 8 


Thus for this problem it seems better to use in- 
equality 9’ than 8 

These tables will facilitate choosing which inequal- 
ty to use when several are available, by comparing 
the associated probabilities deviations) with the de- 
viation (probability) of interest, thus making it pos- 
sible to choose the inequality that gives the smallest 
for the problem at hand 


probability (deviation 


The columns of the tables are associated with the 


! equalities as follows 


Tneq 
1), (la la’ 
2), (2’ 2a). (2b), (6 7 
3 with r=—4 
9 
10’ for r=4 
8 for o | ) 


13 for 2a 2cA=2: 13a for 26 -mMrA=2Z2 13b 


IV. Examples 


iam ple ] Assuming that all soldiers are 
vtween 60 and 78 inches tall, what is the probability 


hat the average height of 500 soldiers is more than 





1 inch away from the average height of all soldiers? 
Solution: Although the population is finite, it 
safe to assume that the measurements in the sample 
are independent. The largest possible variance 
occurs if half the soldiers have height 60 inches, and 
half have height 78 inches, in which case the vari- 
First apply inequality 


Is 


ance is 81 inches squared. 


2a’. Here A=1 inch, o?=81 inches squared, and 
" an o* S1 : - 
n=o00 Thus the answer is === 0.162 One 
nr* 5OO 
can also apply inequality 13a. Here \=500 inches, 
a 500 81 inches, and m IS inches The prob- 


ability is 0.11, and thus for this example inequality 
13 gives more precise results than 2 
In the course ol deposit and with- 


money inh a 


I a m ple 
drawal transactions, such 
radioactive material in a hospital, one often wishes 
That is 


bank, or 


as 


to control the absolute error In a sequence 


of, sav, 100 transactions (a day’s activity) one does 
not want one’s books to differ from one’s assets at 
any time, by more than some fixed amount, say 
1,000 units Assume that the variance due to 
errors Of measuring and ol counting tor eas ly trans- 


action is 400 units squared this value be Ing obtained 
experience The 
what js an upper bound for the prob 
than 


by previous question then nat 


urally arises 
ability of having an accumulated error ¢ 
1,000 units at anv time during the day? 


nore 


Solution Inequality 7 is suited ior this problem 
Here } LOO %() and yor 1 (00) TI us 
L.OO0 L.OOoO . ; 
) 5,and 1/X\ 1/25=—0.04 If instead 
10-20 200 
of 100 transactions there had been 400, then » 100) 
1.000 1.000 . 
g=20, ynorX=1,000, Xd 2.5; and the 
20-20 100) 
l 
resulting probabilty is at most ( - ) -==(.161 
Z.0 H.25 
Example (3 From previous experience assum 
that the correlation between two variables height 
and weight, rainfall and crop vield) is at least O.S8 
If a sample of 25 is made on this bivariate distri- 


the deviations 
eN\Ct eded 


bution, what upper bound for 


from the population means that will not be 


an 


more than 10 percent of the time. where deviations 
are measured in standard units? 
Solution Here one can use inequalit 5 First one 


must solve the following equation 


0.10 | \! 0.64 | 0.10 yU.o8 
A 
] 0.6 1.6 
d 16: A } 
0.1 0.1] 
Thus 
P ( —">4 or 2—"'34)<0.1 
or 
r—p\. 4 yY—p _4 
, or <().] 
Pl, 25% «, <5) 
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Thus both sample means will be within 0.80 
standard units of their respective population means 
with probability at least 0.9. 
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